REPORT  DOCUMENTATION  PAGE 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  this  collection  of  mforrreijon  is  estimated  to  average  i  hour  per  response,  mdudmg  the  time  for  reviewing  instructions,  searching  data  sources. 

gathering  and  maintaining  (he  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection 
or  information,  including  suggestions  for  reducing  this  burden  to  Washington  Headquarters  Service.  Directorate  Tor  Information  Operations  and  Reports, 

1215  Jefferson  Davis  Highway,  Suile  1204.  Arlington.  VA  22202-4302,  and  to  the  Office  of  Management  and  Budget, 

Paperwork  Reduction  Project  (0704-0188)  Washington,  DC  20503, 

PLEASE  DO  MOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ADDRESS* 

1.  REPORT  DATE  (DD-MM-YYYY) 

30-06-2007 

2.  REPORT  TYPE 

Final  Technical  Report 

3.  DATES  COVERED  (From  -  To) 

1  Apr  2004 -31  Mar  2007 

4.  TITLE  AND  SUBTITLE 

Cooperation  and  Consensus  Seeking  for  Teams  of  Unmanned  Air 
Vehicles 


Sa.  CONTRACT  NUMBER 


5b.  GRANT  NUMBER 

FA9550-04-1-0209 


5c.  PROGRAM  ELEMENT  NUMBER 


6.  AUTHOR(SJ 

McLain,  Tim  (mclain@byu.edu) 
Beard,  Randy  (beard@byu.edu) 


5d.  PROJECT  NUMBER 


5e.  TASK  NUMBER 


5f.  WORK  UNIT  NUMBER 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESSES} 
Brigham  Young  University 
Provo,  UT  84602 


8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

Air  Force  Office  of  Scientific  Research  (AFOSR) 

875  N.  Arlington  St.,  Rm.  3112 
Arlington,  VA  22203  .  f  .  . 

_ ili _ 

12.  DISTRIBUTION  AVAILABILITY  STATEMENT 

DISTRIBUTION  A:  Approved  for  public  release;  distribution  unlimited. 


10.  SPONSOR/MONITOR'S  ACRONYM(S) 

AFOSR 


11.  SPONSORING/MONITORING 
AGENCY  REPORT  NUMBER 


AFRL-SR-AR'TR-07-0282 


13.  SUPPLEMENTARY  NOTES 


14.  ABSTRACT 

This  report  describes  research  in  the  area  of  cooperative  control  of  unmanned  air  vehicles  carried  out  at  Brigham  Young 
University  with  the  support  of  AFOSR  during  the  period  from  April  2004  to  March  2007.  Research  efforts  have  centered 
on  the  development  of  cooperative  control  methods  and  consensus-seeking  strategies  for  multi-vehicle  teams  with 
experimental  validation  of  those  methods. 


15.  SUBJECT  TERMS 

cooperative  control,  coordination,  consensus,  unmanned  air  vehicles,  UAVs,  unmanned  aircraft  systems,  UAS 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

13.  NUMBER 

OF  PAGES 

19a.  NAME  OF  RESPONSIBLE  PERSON 

Lt  Col  Scott  R.  Wells 

a.  REPORT 

Unclassified 

b.  ABSTRACT 

Unclassified 

c.  THIS  PAGE 

Unclassified 

Unclassified 

67 

19b.  TELEPONE  NUMBER  ( Include  area  code) 

(703) 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSl-Std  239-16 


COOPERATION  AND  CONSENSUS  SEEKING 
FOR  TEAMS  OF  UNMANNED  AIR  VEHICLES 


Final  Report 

AFOSR  GRANT  NO.  FA9550-04- 1-0209 


Principal  Investigators 

Timothy  W.  McLain 
Department  of  Mechanical  Engineering 
Brigham  Young  University 
Provo,  Utah  84604  USA 
voice:  (801)  422-6537 
fax:  (801)  422-0516 
email:  mclam@byu.edu 


Randal  W.  Beard 

Department  of  Electrical  and  Computer  Engineering 
Brigham  Young  University 
Provo,  Utah  84604  USA 
voice:  (801)  422-8392 
fax:  (801)  422-0201 
email:  beard@byu.edu 


BYU 

Magicc  Lab 


Abstract 

This  report  describes  research  in  the  area  of  cooperative  control  of  unmanned  air  vehicles  carried  out  at 
Brigham  Young  University  with  the  support  of  APOSR  during  the  period  from  April  2004  to  March  2007, 
Research  efforts  have  centered  on  the  development  of  cooperative  control  methods  and  consensus-seeking 
strategies  for  multi- vehicle  teams  with  experimental  validation  of  those  methods. 


Objectives 

Our  research  focused  on  cooperative  control  of  unmanned  air  vehicles  (UAVs).  Under  this  AFOSR 
supported  effort,  we  have  sought  to  accomplish  three  primary  objectives;  1)  extend  our  current  theoretical 
approach  to  encompass  broader  classes  of  cooperative  control  problems,  2}  develop  strategies  for  building 
consensus  among  a  team  of  vehicles  with  inconsistent  sensory  information,  and  3)  experimentally  demonstrate 
the  effectiveness  of  our  cooperative  control  strategies  on  a  team  of  small  UAYs. 

The  technical  content  of  this  report  is  divided  into  four  main  sections.  Section  1  discusses  an  application  of 
our  coordination  variable  approach  to  a  new  cooperative  surveillance  problem.  In  the  cooperative  surveillance 
problem,  a  team  of  UAVs  is  used  to  monitor  the  periphery  of  a  region  of  interest.  The  periphery  can  change 
in  time  requiring  cooperation  among  the  UAYs  to  utilize  resources  effectively.  The  region  of  interest  could 
be  a  national  border,  a  spreading  forest  fire,  a  growing  toxic  plume,  or  a  group  of  fleeing  enemy  combatants. 

Our  recent  work  on  the  development  of  consensus  seeking  strategies  for  teams  of  UAYs  is  detailed  in 
Sections  2  and  3.  Specifically,  we  have  developed  a  method  for  dealing  with  relative  uncertainty  between 
agents.  We  have  shown  that  consensus  algorithms  can  be  developed  that  are  stable  in  the  presence  of 
communication  and  sensor  noise.  In  addition,  we  have  extended  well-known  results  on  average  consensus  to 
account  for  more  reasonable  communication  architectures. 

We  have  conducted  numerous  multiple  UAY  experiments.  We  have  demonstrated  several  cooperative 
timing  scenarios  using  three  UAVs  as  shown  in  Section  4.  These  experiments  represent  significant  progress  in 
two  ways:  First,  they  demonstrate  the  practical  validity  of  our  theoretical  cooperative  control  developments. 
Second,  they  demonstrate  a  significant  increase  in  the  robustness  and  reliability  of  the  multiple  UAV  systems 
in  use.  Flying  multiple  UAVs  has  directed  us  towards  important  research  problems  of  practical  interest. 
Solving  some  of  these  important  problems  has  enabled  increased  reliability  in  multiple  UAY  operations. 
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1  Perimeter  Surveillance 

Perimeter  surveillance  algorithms  form  the  basis  for  effective  monitoring  in  a  number  of  applications  including 
monitoring  oil  spills  [1],  contaminant  clouds  [2],  algae  bloom  [3],  forest  fires  [4,  5],  and  border  security  [6,  7]. 
The  literature  in  this  area  can  roughly  be  decomposed  into  two  main  groups:  sensor  technology  used  for 
perimeter  detection;  and  algorithms  used  to  gather  data  along  the  perimeter  effectively 

Sensors  that  have  been  investigated  for  small  fixed-border  scenarios,  such  as  warehouse  surveillance, 
include  cameras  [8],  ultrasound  [9],  and  radar  [10].  In  [8],  the  authors  discuss  algorithms  that  use  image  data 
from  multiple  cameras  to  determine  a  perimeter  breach.  Peralta  [9]  uses  a  chain  of  ultrasound  sensors  with  a 
simple  detection  scheme  to  identify  border  crossings.  Research  has  also  been  done  using  existing  airport  radar 
equipment  to  identify  when  people  or  vehicles  come  too  close  to  runways  [10].  For  spill  monitoring  and  other 
dynamic  perimeter  scenarios,  surveillance  vehicles  are  equipped  with  chemical  concentration  sensors  [11], 
infrared  cameras  [5],  or  standard  optical  cameras  [1]. 

We  are  particularly  interested  in  borders  that  are  of  unknown  shape  and  size  and  possibly  changing  in 
time  such  as  would  be  encountered  in  a  forest  fire  or  chemical  spill  monitoring  scenario*  Additionally,  we 
do  not  exclude  large  borders  where  communication  range  will  limit  the  possible  interaction  of  the  team.  We 
will  assume  that  UAV  agents  have  the  proper  sensor  suite  to  detect  changes  in  the  perimeter  and  track  the 
edge  of  the  perimeter*  We  will  not  focus  on  the  necessary  sensor  technology  to  do  this,  but  rather  on  the 
algorithms  that  will  allow  a  team  of  agents  to  monitor  a  perimeter  in  a  decentralized  fashion.  Perimeter 
surveillance  using  multiple  UAVs  has  the  advantage  of  operating  in  a  wide  variety  of  circumstances  such  as 
changing  perimeters  (spill  monitoring,  forest  fire  surveillance)  or  very  large  perimeters  (border  patrol)*  In 
addition,  small  UAVs  are  relatively  inexpensive  and  can  be  rapidly  deployed. 

A  number  of  researchers  have  investigated  similar  problems  of  monitoring/tracking  changing  perimeters 
with  autonomous  vehicles.  The  MDARS  project  [12]  is  a  joint  effort  between  the  Army  and  Navy  that 
networks  multiple  ground  robots  to  cooperatively  monitor  a  fixed  perimeter  near  critical  storage  facilities* 
A  team  of  robots  are  equipped  with  coarse  obstacle  detection  sensors  and  a  high  precision  narrow  field  of 
view  sensor  to  find  and  track  objects  that  have  breached  the  perimeter  [7].  The  entire  team  of  vehicles 
communicates  to  a  central  location  where  sensor  data  is  fused  and  waypoint  commands  are  issued  [6] .  Our 
work  differs  from  MDARS  in  that  we  do  not  require  team  agents  to  be  in  constant  communication  with  a 
centralized  controller;  rather,  agents  are  frequently  outside  of  the  communication  range  of  the  other  team 
members  and  must  monitor  the  perimeter  in  a  decentralized  manner* 

Teams  of  unmanned  water  vehicles  have  been  proposed  as  a  way  to  track  algae  bloom  and  oil  spills, 
Bertozzie  et  al*  in  [13]  present  an  algorithm  for  monitoring  a  perimeter  with  multiple  agents  when  each  is 
equipped  with  a  concentration  sensor.  When  the  sensor  detects  the  presence  of  the  chemical,  the  vehicle 
turns  in  one  direction;  in  the  absence  of  chemical  detection,  the  vehicle  turns  in  the  opposite  direction.  In 
this  way,  an  agent  weaves  around  the  perimeter  of  the  spill  while  communicating  the  perimeter  crossing 
points  to  form  a  complete  picture  of  the  perimeter.  A  simple  spacing  law  adjusts  the  speed  of  the  vehicles 
to  allow  the  team  to  spread  out  along  the  perimeter.  The  algorithm  has  been  shown  to  work  in  hardware 
testbed  experiments  with  virtual  perimeters  [14]* 

Clark  and  Fierro  propose  a  similar  method  for  oil  spill  perimeter  tracking  using  multiple  vehicles  [1].  A 
fleet  of  vehicles  is  deployed  and  will  search  the  region  and  communicate  to  team  members  when  the  perimeter 
is  located.  Agents  will  approach  the  perimeter  and  begin  to  track  it  in  a  predetermined  direction.  Spacing 
of  the  vehicles  is  accomplished  by  adjusting  linear  velocity  Hardware  experiments  using  a  camera  sensor  on 
wheeled  robots  is  shown  to  validate  the  algorithm.  In  both  this  approach  and  the  one  proposed  by  Bertozzi 
et  al  [13],  neither  the  efficiency  nor  the  convergence  of  the  algorithms  are  shown  analytically.  In  addition, 
neither  explicitly  address  the  problem  of  limited  communication  range. 

Susea,  Martinez  and  Bullo  address  the  issue  of  approximating  a  changing  border  with  a  set  of  interpolation 
points  [15].  As  the  team  agents  traverse  the  perimeter,  they  update  the  points  that  describe  the  perimeter 
to  best  fit  a  polygon  to  the  shape  of  the  perimeter.  Their  algorithm  is  shown  to  converge  and  relies  only  on 
communication  between  immediate  neighbors. 

In  this  section,  we  will  present  an  algorithm  for  perimeter  surveillance  that:  (1)  is  completely  decentral- 
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ized,  (2)  is  provable  convergent  to  the  optimal  behavior  in  finite-time,  (3)  explicitly  accounts  for  communi¬ 
cation  range  limitations,  and  (4)  allows  for  changing  perimeters.  The  primary  advantages  to  a  decentralized 
approach  are  scalability  and  inherent  system  robustness.  Since  agents  only  make  decisions  based  on  neigh¬ 
bor  interactions,  the  required  communication  bandwidth  and  computation  is  fixed  irrespective  of  the  total 
number  of  agents  on  the  team.  Decentralization  is  inherently  robust  since  each  agent  makes  decisions  with 
its  available  information  without  a  need  to  receive  directions  from  a  central  location.  This  eliminates  single 
points  of  failure  and  allows  a  system  to  adapt  naturally  to  changes  in  team  size.  Agents  can  be  inserted  and 
deleted  from  the  team  at  any  time  and  the  system  will  adjust  since  each  agent  will  maneuver  to  find  its  new 
neighbors.  This  allows  agents  to  leave  the  team  for  high  priority  tasks,  such  as  following  a  perimeter  breach, 
or  in  case  of  accident  or  refueling. 

in  addition  to  being  fully  decentralized,  our  approach  is  optimal  at  steady-state  and  has  finite-time 
convergence.  Additionally,  our  approach  requires  very  little  communication  bandwidth  and  accounts  for 
UAV  kinematic  constraints.  The  algorithm  is  limited  to  constant  velocity  vehicles  that  travel  along  the 
border  and  due  to  its  decentralized  nature,  any  global  information  that  may  be  available  is  not  exploited. 
For  missions  where  robustness  is  valued  more  than  efficiency,  our  approach  is  a  natural  fit.  Since  it  guarantees 
optimality  in  steady-state  and  finite-time  convergence,  only  missions  that  have  strict  efficiency  requirements 
would  not  be  well -suited. 

The  perimeter  surveillance  problem  is  posed  in  Sections  LI  and  1.2.  Section  L3  presents  our  solution 
using  a  coordination  variable  [16]  approach  and  compares  it  to  both  averaging  and  centralized  solutions. 
The  method  is  extended  to  changing  perimeters  in  Section  1.4  and  to  account  for  constrained  UAV  turning 
radius  in  Section  1.5.  Simulation  and  hardware  results  are  presented  in  Section  1.6.  Finally,  Section  1.8 
gives  our  conclusions. 

1.1  Problem  Formulation 

The  objective  of  the  cooperative  perimeter  surveillance  problem  is  to  cooperatively  gather  information  about 
the  state  of  the  perimeter  and  transmit  that  data  to  a  central  base  station  with  as  little  delay  and  at  the 
highest  rate  possible.  There  are  a  number  of  factors  that  complicate  the  perimeter  surveillance  problem 
including: 

1 .  Perimeter  topology 

2.  Communication  constraints 

3.  Team  logistics 

4.  UAV  capability. 

Perimeter  Topology.  A  perimeter  may  be  static,  such  as  a  well-defined  border,  or  changing  in  time, 
such  as  a  chemical  spill  or  forest  fire.  A  perimeter  can  be  composed  of  a  web  of  segments  and  nodes  that  must 
be  monitored,  such  as  a  set  of  city  streets  or  a  network  of  paths  iri  the  mountains,  although  we  do  require 
the  graph  representing  the  perimeter  to  be  strongly  connected,  i.e.  not  disconnected.  An  area  surveillance 
problem  can  sometimes  be  posed  as  a  perimeter  surveillance  problem  by  constructing  a  patii  that  covers  the 
full  area  {e.g.  zamboni  pattern)  and  then  monitoring  that  path  as  a  perimeter.  The  perimeter  location  need 
not  be  known  a  priori ,  but  when  this  is  the  case  we  assume  that  the  UAVs  have  the  sensor  capacity  to  detect 
and  follow  the  perimeter  autonomously. 

Communication  Constraints.  Small,  inexpensive  UAVs  often  have  limited  communication  bandwidth 
and  short  communication  range.  In  scenarios  where  the  perimeter  is  very  targe  or  terrain  causes  lineof-sight 
problems  agents  may  frequently  be  out  of  communication  range  of  the  base  station  and  neighboring  UAVs. 
Additionally,  the  gathered  data  may  require  significant  time  to  transmit  when  a  UAV  is  in  communication 
range  of  its  neighbors  (e.g.  complete  video  footage). 

Team  Logistics.  UAVs  have  limited  flight  time  and  must  be  periodically  refueled.  In  many  cases,  a 
UAV  may  be  re-tasked  to  investigate  a  perimeter  breach.  Hardware  failures  and  hazardous  flying  conditions 
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may  unexpectedly  remove  a  UAV  from  involvement.  A  perimeter  surveillance  solution  should  be  robust  to 
failures  and  allow  for  interruptions  such  as  reassignment  and  refueling, 

UAV  Capability,  The  maneuverability  of  the  UAV  agents  also  effects  the  monitoring  of  a  perimeter. 
We  assume  that  the  UAVs  are  equipped  with  an  autopilot  similar  to  the  one  described  in  [17]*  The  autopilot 
maintains  constant  altitude  and  each  UAV  on  the  team  is  given  a  unique  altitude  assignment.  The  autopilot 
has  been  tuned  so  that  the  closed-loop  system  exhibits  a  first-order  response  to  roll  and  airspeed  commands. 
The  equations  of  motion  for  a  single  UAV  can  be  written  as 


PN  —  V  cos  +  wN 

(i) 

PE  =*  V  sin  ip  +  We 

(2) 

$ -  ^7  tan  0 

(3} 

V=av(Ve~V) 

(4) 

II 

Q 

& 

1 

(5) 

where  p  —  ( Pn*Pe)t  is  the  inertial  position  of  the  UAV,  <j> ,  and  V  are  the  heading,  roll  angle,  and  airspeed 
of  the  UAV,  g  is  the  gravitational  constant,  w  =  (wn,  we)t  is  the  windspeed,  and  Vc  and  <fic  are  the  airspeed 
and  roll  angle  commands  given  to  the  autopilot.  The  first  order  response  of  the  autopilot  to  airspeed  and  roll 
angle  commands  are  quantified  by  the  parameters  ay  and  a# .  In  addition  to  these  kinematics,  a  constraint 
on  roll  angle  -<£max  <f>  max  is  enforced  to  ensure  the  safety  of  the  UAV.  The  presence  of  wind  and  the 

roll  angle  constraint  impair  the  maneuverability  of  the  team  agents. 

Developing  a  perimeter  surveillance  algorithm  that  accounts  for  these  complications  and  efficiently  gathers 
data  about  the  perimeter  state  is  not  trivial  We  reduce  the  general  problem  to  a  more  manageable,  but  still 
applicable,  one  and  present  the  team  behavior  that  efficiently  solves  that  problem  in  Section  L2.  Section  1*3 
then  introduces  and  proves  an  algorithm  for  reaching  the  desired  behavior  while  accounting  for  most  of  the 
complications, 

1.2  Linear  Perimeter  Surveillance 

We  reduce  the  general  perimeter  surveillance  problem  of  Section  1.1  to  the  linear  surveillance  problem  by 
requiring  that  the  perimeter  to  be  monitored  be  represented  as  a  single  path  between  two  points,  i.e.  the 
perimeter  is  homeomorphic  to  a  line.  This  removes  perimeters  that  are  circular  or  are  connected  in  a  web- 
type  format.  However,  an  arbitrary  perimeter  can  be  posed  as  a  linear  perimeter  by  constructing  a  single 
tour  that  traverses  all  segments  in  the  original  perimeter.  In  practice,  a  surveillance  mission  will  have  a  base 
of  operations  where  information  about  the  perimeter  is  analyzed  and  team  agents  are  refueled  and  launched. 
Circular  perimeters  can  be  treated  as  linear  perimeters  with  both  endpoints  at  the  base  of  operations. 

A  linear  perimeter  imposes  a  natural  order  to  the  team  where  each  agent  has  at  most  two  immediate 
neighbors  along  the  perimeter.  By  requiring  that  neighbors  physically  meet  to  transmit  information,  any 
size  of  communication  range  is  allowed.  In  practice,  the  sensor  limitations  of  an  agent  will  require  it  to 
monitor  the  entire  perimeter  up  to  its  neighbor  regardless  of  whether  the  agents  can  communicate  earlier, 
therefore  we  assume  that  agents  must  meet  to  transmit  data.  Agent  meetings  can  be  extended  to  allow 
the  transmission  of  large  amounts  of  data.  Loss  or  reassignment  of  team  agents  are  quickly  noticed  by  the 
change  in  the  neighborhood  of  affected  agents. 

Team  planning  is  done  by  considering  agents  as  point  agents  that  move  at  uniform  constant  velocity  along 
the  perimeter  (see  Figure  1).  Corresponding  UAV  agents  follow  their  reference  points  along  the  perimeter 
as  described  in  Section  L5,  Point  agents  can  reverse  direction  instantaneously  and  al'ways  do  so  when  the 
end  of  the  perimeter  is  encountered.  Communication  between  agents  is  only  allowed  when  the  agents  are 
“touching”,  i.e,  when  they  occupy  the  same  physical  location.  One  way  to  visualize  the  problem  is  to  imagine 
beads  sliding  along  a  string* 

The  performance  of  a  particular  monitoring  algorithm  can  be  measured  by  the  latency  associated  with 
information  about  points  along  the  perimeter.  Let  P  be  the  length  of  the  perimeter  and  let  the  perimeter 
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Figure  1 :  Example  scenario  where  8  agents  monitor  a  linear  perimeter. 


be  defined  as  a  line  along  the  x-axis  beginning  at  x  —  0  and  continuing  until  x  —  P,  Since  we  assume  that 
the  agents  travel  at  uniform  velocity,  V ,  and  data  transmission  only  occurs  when  the  agent  is  in  immediate 
physical  proximity,  the  soonest  information  about  point  Xq  is  available  to  a  recipient  at  the  base  of  operations 
(x  =  0)  is  in  xq/V  seconds.  The  minimum  latency  profile  is  obtained  when  an  agent  starts  at  the  far  end  of 
the  perimeter  and  travels  to  the  base  of  operations  at  which  time  it  transmits  all  the  perimeter  information. 

Note  that  adding  more  agents  cannot  decrease  the  latency  of  the  gathered  information  as  seen  at  the  base 
of  operations  since  information  can  only  travel  as  fast  as  a  single  agent.  However,  increasing  the  number  of 
agents  on  the  team  increases  the  refresh  mte  of  the  perimeter  state.  Intuitively,  spacing  agents  equally  so  that 
the  refresh  rate  is  constant  will  yield  the  most  efficient  method  of  perimeter  monitoring.  This  configuration 
can  be  achieved  by  tasking  each  agent  to  travel  to  the  end  of  the  perimeter  and  then  monitor  the  entire 
perimeter  as  it  returns  to  the  base  while  launching  agents  at  2P/N  intervals  where  JV  is  the  number  of  agents 
on  the  team.  As  agents  monitor  the  perimeter  while  traveling  to  the  base  of  operations  they  pass  agents 
traveling  to  the  end  of  the  perimeter  to  begin  monitoring.  These  meetings  occur  at  equally  spaced  intervals 
of  size  PfN,  Rather  than  have  agents  traverse  the  entire  perimeter  equally  spaced,  each  can  be  responsible 
for  a  segment  of  length  PfN  and  pass  its  gathered  information  to  its  neighbors  achieving  the  same  overall 
latency  profile  and  refresh  rate. 

Consider  the  behavior  of  a  team  of  four  agents  as  shown  in  Figure  2.  The  agents  are  uniformly  distributed 
along  the  perimeter  (Figure  2(a))  and  each  agent  meets  its  neighbors  at  the  end  of  its  segments  (Figures  2(b) 
and  2(c)).  This  oscillatory  behavior  of  the  agents  requires  that  the  team  be  synchronized  not  only  in  space 
(equally  distributed),  but  also  in  time  (meet  neighbors  at  the  end  of  segments). 
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(a)  Agents  are  uniformly  spaced  along 
the  perimeter. 

(bj  Neighbors  meet  and  exchange 
perimeter  state  information. 

(c)  Perimeter  state  is  carried  to  all 
points  along  the  perimeter. 

Figure  2:  Information  exchange  pattern  that  allows  information  about  the  state  of  the  perimeter  to  be 
available  at  any  point  along  the  perimeter. 

By  examining  the  behavior  illustrated  in  Figure  2  it  can  be  seen  that  information  gathered  at  neighbor 
meeting  locations  travels  to  all  other  locations  along  the  perimeter  in  the  shortest  time  possible.  This  can 
be  seen  by  noting  that  after  two  agents  meet  and  gather  information  about  the  perimeter  at  their  meeting 
place,  each  will  take  this  information  at  speed  V  to  any  other  place  along  their  respective  segments.  This 
information  is  passed  to  their  respective  neighbors  who  carry  it  further  along  the  perimeter,  again  at  speed 
V ,  Therefore,  the  information  gathered  at  the  neighbor  meeting  locations  is  carried  to  all  other  points  along 
the  perimeter  at  the  highest  possible  speed. 

Definition  1*1  (Low-Latency  Exchange  Configuration),  Consider  a  team  of  N  agents  monitoring  a  linear 
perimeter  of  length  P  defined  as  a  line  along  the  x-axis  from  x  =  0  to  x  —  P.  Order  the  agents  from  the  left 
end  of  the  perimeter  1 ...  N .  Consider  two  sets  of  team  agent  locations  on  the  perimeter : 

1.  Agent  n  £  l . . .  N  is  located  at  [n  +  ^( — 1)”] 

2.  Agent  n  €  l ...  N  is  located  at  \ n  +  j(— l)n]P/JV 


4 


BYU 

13  Decentralized  Solution  Magicc  Lab 


The  low- latency  exchange  configuration  is  the  behavior  realized  by  the  team  when  oscillating  between  these 
two  team  locations  at  speed  V . 

As  indicated  earlier1  the  low-latency  exchange  configuration  is  the  ideal  behavior  for  a  team  of  agents 
monitoring  a  linear  perimeter  and  it  will  be  the  desired  steady-state  behavior  of  the  decentralized  algorithm 
presented  in  Section  1.3.  In  addition  to  converging  to  the  low-latency  exchange  configuration,  the  algorithm 
will  address  deletion  and  insertion  of  team  members  and  variable  length  perimeters. 

1-3  Decentralized  Solution 

This  section  derives  a  decentralized  algorithm  to  reach  the  low- latency  exchange  configuration  defined  in 
Definition  1.1.  One  way  to  approach  such  a  problem  is  to  determine  the  coordination  variables  [18]  or 
minimum  amount  of  information  necessary  to  achieve  cooperation*  Three  such  critical  pieces  of  information 
for  this  problem  are:  (1)  the  perimeter  length,  (2)  the  number  of  agents  on  the  left  side  of  the  perimeter 
relati%Fe  to  a  given  agent,  and  (3)  the  number  of  agents  on  the  right  side  of  the  perimeter  relative  to  a  given 
agent.  When  each  agent  has  coordination  variables  that  match  reality,  then  each  will  be  able  to  compute 
the  perimeter  segment  for  which  it  is  responsible.  The  first  step  in  the  decentralized  solution  is  to  ensure 
that  when  each  agent  has  the  proper  values,  that  coordination  will  be  achieved. 

Consider  an  algorithm  where  each  agent  assumes  responsibility  for  a  portion  of  the  perimeter  and  escorts 
any  of  its  intruding  neighbors  to  their  shared  segment  border*  The  following  algorithm  ensures  that  if 
each  agent  has  coordination  variable  values  that  match  reality  (he*  each  agents  knows  the  length  of  the 
perimeter,  the  total  number  of  agents  on  the  team,  and  its  position  in  the  team),  then  the  low- latency 
exchange  configuration  will  reached* 

Algorithm  1:  Neighbor  Escort _ 

if  rendezvous  with  neighbor  then 
Calculate  shared  border  position* 

TYavel  with  neighbor  to  shared  border. 

Set  direction  to  monitor  own  segment, 
else  if  reached  perimeter  endpoint  then 
Reverse  direction, 
else 

Continue  in  current  direction. 


For  every  consecutive  pair  of  agents,  there  is  a  single  position  where  their  segments  border  each  other. 
When  each  agent  has  a  knowledge  of  the  length  of  the  perimeter  and  its  order  in  the  team,  then  the  endpoints 
of  its  responsible  segment  are  easily  computed*  If  an  agent  has  determined  that  its  position  in  the  team  is  n 
and  the  length  of  the  perimeter  is  Pt  then  the  segment  for  which  that  agent  is  responsible  has  endpoints  at 
(rc  —  1)P/N  and  nP/N.  The  endpoint  shared  with  a  neighbor  is  the  shared  border  position  to  which  both 
will  travel  together  in  the  first  phase  of  Algorithm  1.  In  other  words,  each  agent  escorts  its  neighbors  to  the 
position  at  which  they  should  have  met  had  they  been  in  perfect  synchronization.  Since  agents  only  reverse 
direction  at  perimeter  ends  and  when  they  finish  escorting  neighbor  agents,  each  agent  is  guaranteed  to  meet 
its  neighbors. 

Theorem  1*1.  Let  the  perimeter  length  and  number  of  agents  be  fixed ,  //  all  agents  have  coojdination 
values  that  match  realityf  then  Algorithm  1  ensures  that  the  low-latency  exchange  configuration  is  achieved 
after  time  2 T  has  passed  where  T  corresponds  to  the  time  required  for  one  agent  to  travel  the  length  of  the 
perimeter 

Proof:  Team  agents  can  initially  be  positioned  anywhere  along  the  perimeter  and  be  traveling  either  in  the 
positive  or  negative  direction  (recall  that  constant  uniform  velocity  is  assumed).  Since  each  agent  has  correct 
coordination  variables,  then  each  can  calculate  the  segment  along  the  perimeter  for  which  it  is  responsible. 
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Agents  are  guaranteed  to  meet  both  neighbors  since  Algorithm  1  only  commands  agents  to  reverse  direction 
at  a  perimeter  endpoint  or  when  concluding  a  neighbor  escort. 

For  N  agents  monitoring  a  border  of  length  Z5,  order  segments  of  size  P/N  from  the  left  edge  of  the 
perimeter  1  and  let  the  number  of  each  agent  correspond  to  the  segment  for  which  it  is  responsible. 

Consider  the  effect  of  Algorithm  1  on  the  leftmost  agent  (agent  1).  Once  agent  1  lias  escorted  its  right 
neighbor  to  their  shared  border,  then  no  agent  to  the  right  of  agent  1  will  ever  travel  along  segment  1  again. 
This  can  be  seen  by  noting  that  after  agents  1  and  2  split  at  their  shared  boundary  both  will  travel  the 
length  of  one  segment  to  get  to  the  opposite  end  of  their  respective  segments.  If  agent  2  meets  agent  3 
along  the  way,  then  agents  2  and  3  will  continue  to  their  shared  border  before  agent  2  reverses  direction,  as 
in  Figure  1.3.  Therefore,  agent  2  will  travel  at  least  one  segment  length  away  from  the  boundary  between 
segments  1  and  2,  Since  both  travel  at  a  uniform  constant  velocity,  then  agent  1  will  arrive  back  at  the 
border  between  segments  1  and  2  at  the  same  time  or  before  agent  2,  but  never  after.  Now  consider  agent 
2  after  it  has  been  escorted  by  agent  1  to  their  shared  boundary.  Since  by  this  time  agent  2  never  ventures 
into  segment  1 ,  the  border  between  agents  l  and  2  can  be  regarded  as  a  fixed  perimeter  endpoint  for  agent  2. 
The  same  analysis  now  holds  if  we  consider  agent  2  the  leftmost  agent  in  a  set  of  N  —  I  agents.  Observe  that 
the  same  argument  holds  starting  with  the  rightmost  agent  and  considering  all  agents  to  the  left.  Therefore, 
there  is  a  time  r  after  which  all  agents  are  only  found  on  their  respective  segments.  This  implies  that  the 
low-latency  exchange  configuration  of  Section  1.2  lias  been  reached, 

2  2 

1  1  3 

(a)  Agents  1  and  2  separate  at  their  shared  border.  (b)  Agent  2  encounters  agent  3  earlier  than  expected  and 

escorts  it  to  their  shared  border  before  reversing  direction. 


1 

(c)  Agents  1  and  2  separate  at  their  shared  border. 

The  worst  case  situation  occurs  when  all  agents  are  stacked  infinitesimally  close  at  one  end  of  the  perimeter 
and  are  traveling  toward  the  other.  Once  T  has  passed  all  agents  are  at  the  opposite  end  of  the  perimeter 
where  they  meet  both  neighbors.  Each  pair  will  travel  to  their  shared  borders  which  for  the  farthest  pair 
will  require  a  travel  time  less  than  T.  Therefore,  the  steady-state  behavior  will  be  achieved  before  time  2T, 

■ 

Figure  3  shows  two  simple  scenarios  with  8  agents  spreading  out  over  a  fixed  perimeter  where  each  agent 
begins  with  correct  coordination  variables.  The  positions  of  agents  along  the  perimeter  is  indicated  vertically 
with  the  time  axis  shown  horizontally  The  lattice  structure  indicates  that  the  desired  steady-state  behavior 
has  been  reached  since  agents  turn  around  at  precisely  their  desired  neighbor  rendezvous  locations.  Note 
that  the  agents  require  very  few  meetings  with  other  agents  to  converge  to  the  proper  configuration. 

1-3-1  Comparison  with  Centralized  Algorithm 

To  understand  the  value  of  Algorithm  1,  it  is  useful  to  compare  its  performance  with  other  methods  of  perime¬ 
ter  surveillance,  A  centralized  method  for  reaching  the  low-latency  exchange  configuration  is  to  compare  the 
initial  positions  of  the  team  with  all  possible  team  locations  in  the  low-latency  exchange  configuration  and 
find  the  one  that  requires  the  shortest  convergence  time. 


2 

T  3 

(d)  Agent  2  encounters  agent  3  later  than  expected  and 
escorts  it  to  their  shared  border  before  continuing  on  to 
meet  agent  1. 
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(a)  Initial  positions  and  directions  for  a  group  of  8  agents  in 
scenario  A. 


(c)  Position  of  agents  along  the  perimeter  over  time  for  sce¬ 
nario  A, 


2  4  6  8 

T  3  5  7 

(b)  Initial  positions  and  directions  for  a  group  of  8  agents  in 
scenario  B. 


(d)  Position  of  agents  along  the  perimeter  over  time  for  sce¬ 
nario  B, 


Figure  3:  Team  behavior  in  two  scenarios  for  point  agents  whose  behavior  is  governed  by  Algorithm  L  The 
position  of  agents  along  the  perimeter  is  indicated  vertically  with  the  time  axis  shown  horizontally.  The 
lattice  structure  indicates  that  the  desired  steady-state  behavior  has  been  reached. 
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Let  S  be  the  set  of  team  positions  during  the  desired  steady-state  operation  where  an  element  s  €  $ 
consists  of  N  posit  ions,  s,  corresponding  to  the  position  of  agent  i  in  the  low-latency  exchange  configuration. 
Note  that  the  set  of  all  team  configurations  that  satisfy  the  low- latency  exchange  configuration  can  be 
parameterized  by  the  position  of  the  first  agent 


t  -  1  f  (^  —  -si)  if  i  is  even 

N  \  Si  otherwise  . 


(«) 


Therefore,  if  the  position  of  the  first  agent  is  known  in  the  low  latency  configuration,  then  for  a  perimeter 
of  unit  length,  s  i  is  on  the  interval  [0,  and  all  other  positions  can  be  calculated  using  Equation  (6).  The 
centralized  method  is  to  command  the  team  located  at  i  =  1  . . .  N  to  converge  to  s *  where 


s*^argmin  max  |pi  —  s,|  ,  (7) 

i— 1...N 

In  other  words,  the  optimal  solution  is  to  pick  the  closest  team  position  in  the  low-latency  configuration 
to  the  current  position  of  the  agent.  During  the  transition  From  the  initial  position  to  the  nearest  low- 
latency  configuration  position  s*,  agents  reach  their  correct  position  and  wait  there  until  the  remaining  team 
members  have  reached  their  respective  positions, 

in  the  worst  case  scenario  with  all  agents  located  at  one  end  of  the  perimeter,  the  centralized  algorithm 
converges  in  T,  twice  a s  fast  as  the  decentralized  method.  Figure  4  shows  a  comparison  of  the  centralized 
algorithm  and  Algorithm  1.  Note  that  the  centralized  algorithm  requires  agents  to  wait  or  loiter  at  the 
proper  location  until  all  agents  have  reached  s *.  This  is  indicated  by  the  straight  lines  in  Figure  4, 

Monte-Carlo  simulations  indicate  that  the  centralized  algorithm  readies  the  low-latency  exchange  config¬ 
uration  on  average  G.67T  faster  than  the  decentralized  method  (standard  deviation  of  0.17T1).  The  maximum 
time  difference  between  the  centralized  algorithm  and  Algorithm  1  was  0.998T  with  theoretical  worst  case 
difference  of  T ,  The  centralized  algorithm  requires  complete  knowledge  of  the  state  of  the  team  and  explicit 
cooperation  of  all  team  members.  The  value  of  Algorithm  1  is  that  its  performance  is  comparable  to  the 
optimal  solution  in  speed,  but  is  implemented  in  a  decentralized,  robust  way. 


1,3.2  Comparison  with  Consensus  Method 

The  second  method  to  which  we  compare  Algorithm  1  is  a  distributed  consensus  algorithm  modified  for 
perimeter  surveillance.  The  standard  consensus  problem  is  when  a  group  of  agents,  each  with  a  variable  of 
arbitrary  initial  value,  reaches  agreement  on  the  value  of  that  variable  through  repeated  interaction  with 
other  agents  on  the  team.  For  example,  one  method  of  coming  into  consensus  is  for  each  agent  to  repeatedly 
average  its  associated  variable  with  those  communicated  from  its  immediate  neighbors.  If  the  interaction 
graph  among  the  team  contains  a  spanning  tree,  then  the  variable  of  each  agent  will  asymptotically  approach 
a  constant  shared  value  and  the  team  is  said  to  asymptotically  reach  consensus  [19] . 

Adapting  a  consensus  method  to  the  perimeter  surveillance  problem  involves  defining  the  value  associated 
for  each  agent  and  a  strategy  for  updating  those  values.  Let  the  length  of  the  segment  for  which  an  agent  is 
responsible  be  the  value  associated  with  that  agent.  Consider  the  rendezvous  of  two  agents  on  the  perimeter. 
When  agents  meet,  they  communicate  the  length  of  their  respective  segments  and  average  to  find  the  midpoint 
of  their  shared  segment.  Both  travel  together  to  the  midpoint  of  their  shared  segment  [20]  and  split  apart 
with  an  updated  value  for  how  much  of  the  perimeter  each  is  responsible  for.  For  every  pair  of  agents,  their 
shared  segment  is  defined  by  endpoints  determined  by  the  locations  each  agent  met  its  previous  neighbor. 

The  difficulty  with  this  method  lies  in  the  realization  that  the  value  to  which  the  team  will  converge  must 
be  P/N  where  P  is  the  length  of  the  perimeter;  otherwise,  agents  would  be  continuously  overlapping  or 
neglecting  part  of  the  perimeter.  A  specialization  of  the  general  consensus  problem  to  the  average  consensus 
problem  can  be  made  which  ensures  that  the  team  will  converge  to  the  exact  average  of  the  initial  values. 
The  only  remaining  difficulty  is  initializing  the  system  so  that  the  segment  lengths  associated  with  the  team 
of  agents  sum  to  P .  We  do  this  by  assuming  that  agents  are  launched  with  a  value  of  zero  with  the  exception 
of  the  first  agent  who  travels  to  the  end  of  the  perimeter  and  initializes  its  value  to  P.  This  had  a  number  of 
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(a)  Initial  positions  and  directions  for  a  group  of  8  agents. 


(b)  Initial  positions  and  directions  for  a  group  of  S  agents. 


(c)  Position  of  agents  along  the  perimeter  over  time  when  using 
the  centralized  algorithm  (7). 


(d)  Position  of  agents  along  the  perimeter  over  time  when  us¬ 
ing  Algorithm  L 


Figure  4:  Team  behavior  in  a  comparison  of  Algorithm  1  and  the  centralized  algorithm  (7).  The  position  of 
agents  along  the  perimeter  is  indicated  vertically  with  the  time  axis  shown  horizontally.  The  lattice  structure 
indicates  that  the  desired  steady-state  behavior  has  been  reached.  Straight  lines  indicate  that  an  agent  is 
maintaining  its  current  position  along  the  perimeter.  Note  that  the  centralized  algorithm  requires  agents  to 
wait  for  the  rest  of  the  team  to  settle  into  the  optimal  starting  configuration  while  Algorithm  1  reaches  the 
low- latency  exchange  configuration  after  some  interaction  time. 
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consequences:  although  the  algorithm  can  account  for  arbitrary  perimeter  length,  the  perimeter  must  remain 
fixed;  secondly,  loss  of  an  agent  during  the  mission  will  remove  its  segment  length  from  the  knowledge  of 
the  team.  In  each  case,  the  prerequisites  for  average  consensus  would  be  violated  and  the  team  would  fail 
to  converge.  Additionally,  convergence  is,  in  general,  asymptotic  in  nature  rather  than  in  finite-time  as 
Algorithm  1  guarantees.  Figure  5  shows  the  performance  of  the  average  consensus  algorithm  compared  to 
Algorithm  1 .  In  addition  to  the  above  limitations  of  fixed  perimeter  length  and  asymptotic  convergence,  the 
consensus  method  seems  to  exhibit  poor  transient  response. 
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(a)  Initial  positions  and  directions  for  a  group  of  8  agents. 


(b)  Initial  positions  and  directions  for  a  group  of  8  agents. 


(c)  Position  of  agents  along  the  perimeter  over  time  when  using 
the  centralized  algorithm  (7). 


(d)  Position  of  agents  along  the  perimeter  over  time  when  us^ 
ing  Algorithm  1. 


Figure  5:  Team  behavior  in  a  comparison  of  Algorithm  1  and  a  consensus  method  (7).  The  position  of 
agents  along  the  perimeter  is  indicated  vertically  with  the  time  axis  shown  horizontally.  The  lattice  structure 
indicates  that  the  desired  steady-state  behavior  has  been  reached.  Note  that  the  consensus  method  converges 
asymptotically  while  Algorithm  1  reaches  the  low-latency  exchange  configuration  in  finite  time. 

Algorithm  I  relies  only  on  interactions  of  an  agent  with  its  immediate  neighbors  on  the  perimeter  yet 
converges  to  the  low- latency  exchange  configuration  in  finite  time.  In  Section  L4  we  show  that  decentralized 
nature  of  the  algorithm  allows  the  team  to  accommodate  loss  or  reassignment  of  agents.  In  the  event  of  a 
perimeter  breach,  an  agent  can  be  assigned  to  follow  the  intruder  while  the  rest  of  the  team  reconfigures  to 
monitor  the  border  in  its  absence.  Since  the  algorithm  converges  in  finite-time,  the  loss  in  perimeter  coverage 
is  quickly  compensated  for.  This  same  natural  reconfiguration  behavior  is  desirable  in  the  event  of  refueling 
and  agent  loss  due  to  hazardous  conditions. 


1.4  Changing  Perimeters 

Theorem  LI  ensures  finite  time  convergence  to  the  low- latency  exchange  configuration  of  Section  1.2  when  the 
coordination  variables  match  reality.  By  adding  the  ability  to  update  the  coordination  variables,  Algorithm  1 
can  be  modified  to  ensure  that  there  will  be  a  time  when  the  coordination  variables  are  correct.  This  will 
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allow  the  team  to  naturally  compensate  for  agent  reassignment  or  loss  and  perimeter  growth. 

Each  agent  maintains  local  variables  that  track  the  amount  of  perimeter  distance  and  number  of  agents 
on  each  side.  These  coordination  variables  are  updated  when  meeting  with  another  agent  on  the  team  - 
both  agents  have  knowledge  about  the  number  of  agents  and  perimeter  length  on  their  respective  sides.  If 
the  perimeter  and  number  of  agents  is  fixed,  then  the  coordination  variables  will  eventually  be  consistent 
among  the  team  since  agents  are  guaranteed  to  meet  both  neighbors.  Once  the  coordination  variables  are 
correct,  Theorem  1.1  ensures  that  the  desired  steady-state  behavior  will  be  achieved.  Note  that  the  same 
method  used  to  update  the  coordination  variables  can  also  be  used  to  detect  changes  in  the  perimeter  or 
insert  ion /deletion  of  team  members.  Algorithm  2  includes  the  local  coordination  variables  and  the  logic 
required  to  update  each  agent.  In  Algorithm  2  the  variable  ‘side’  is  used  to  indicate  the  positioning  of  an 
agent  relative  to  its  neighbor,  he.  left1  or  'right1  along  the  perimeter.  The  term  'side1  is  simply  the  opposite 
of  'side1. 

Algorithm  2:  Variable  Neighbor  Escort 
Variables: 

/left  length  of  perimeter  to  left  of  agent 
P right  length  of  perimeter  to  right  of  agent 
iV|cft  number  of  agents  to  left  of  agent 
AVight  number  of  agents  to  right  of  agent 
if  rendezvous  with  neighbor  then 
Communicate  /^de  and  Na\^e. 

Update  and  N^, 

Calculate  shared  border  position. 

Travel  with  neighbor  to  shared  border. 

Set  direction  to  monitor  own  segment, 
else  if  reached  perimeter  endpoint  then 
Reverse  direction. 

Fside  =  0 

^aide  =  0 

else 

Continue  in  current  direction  while  updating  Fajde« 


Algorithm  2  operates  in  the  same  manner  as  Algorithm  I  only  with  the  additional  steps  of  communicating 
and  updating  the  coordination  variables.  For  example,  consider  two  agents  starting  from  opposite  ends  of 
the  perimeter,  each  without  knowledge  of  the  other.  Let  agent  1  start  at  x  =  0  and  agent  2  start  at  x  =  P, 
but  let  the  launch  time  of  agent  2  be  delayed  with  respect  the  launch  of  agent  1.  As  each  agent  progresses 
along  the  perimeter,  it  keeps  track  of  the  distance  traveled  from  launch.  When  the  two  agents  finally  meet, 
agent  1  updates  Afright  to  be  equal  to  one  plus  the  number  of  agents  to  the  right  of  agent  2  and  Flight  equal 
to  flight  communicated  from  agent  2;  similarly,  agent  2  updates  7V|eft  and  F\cit  from  the  communication 
from  agent  1.  At  this  point,  the  coordination  variables  match  reality  and  Theorem  1.1  ensures  that  the 
low- latency  exchange  configuration  will  be  reached  in  finite  time. 

The  applicability  of  Algorithm  2  lies  in  the  fact  that  each  agent  has  finite  memory.  Since  the  coordination 
variables  are  updated  with  the  most  recent  information  gathered,  past  information  does  not  affect  team 
behavior.  This  finite  memory  property  allows  the  team  to  adapt  to  changes  in  perimeter  and  team  size. 
Figure  6  shows  agent  tracking  on  a  perimeter  with  a  step  change  in  size  and  a  perimeter  with  sinusoidal 
growth.  The  algorithm  accommodates  step  changes  in  perimeter  size,  but  also  allows  good  tracking  for  other 
types  of  perimeter  growth.  Note  that  agents  do  not  have  any  knowledge  a  priori  of  the  perimeter  length  or 
number  of  agents  on  the  team.  The  coordination  variables  are  updated  through  repeated  interactions  with 
other  team  members. 

Algorithm  1  can  also  be  extended  to  account  for  long  communication  events.  In  a  perimeter  imaging 
scenario,  agents  survey  the  perimeter  segment  for  which  they  are  responsible  and  when  each  meets  its 
neighbor  it  must  transmit  large  amounts  of  data.  In  this  case,  agent  meetings  cannot  be  instantaneous,  rather 
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(a)  Positions  of  agents  over  time  for  step  change  in  perimeter 
Length. 


{b)  Positions  of  agents  over  time  for  sinusoidal  perimeter 
growth. 


Figure  6:  Team  behavior  of  agents  tracking  changing  perimeters  using  Algorithm  2  to  continuously  update 
the  coordination  variables.  Agents  learn  the  size  of  the  perimeter  and  number  on  the  team  through  repeated 
interaction  with  other  team  members. 


a  fixed  amount  of  time  is  allotted  for  agents  to  loiter  at  the  meeting  location  to  allow  longer  communication 
events.  After  an  agent  finishes  escorting  its  neighbor,  both  loiter  together  for  a  pre-determ ined  amount  of 
time.  Figure  7  shows  a  scenario  involving  long  communication  events. 


Figure  7:  Example  scenario  where  Algorithm  1  is  modified  to  account  for  long  rendezvous  timing. 


1.5  UAV  Agents 

Algorithm  1  developed  the  motion  of  the  reference  points  for  a  team  of  UAYs  to  follow  to  achieve  the  low- 
latency  exchange  configuration.  In  practice,  the  reference  point  generalization  is  only  followed  when  agents 


12 


BYU 

1.6  Simulation  Results  Mag  ICC  Lab 


are  involved  in  a  rendezvous  with  another  agent.  Between  meetings,  the  center  of  the  constant  airspeed 
UAV  is  considered  the  point  along  the  perimeter.  However,  since  a  UAV  has  a  constrained  turning  radius,  it 
cannot  precisely  follow  a  reference  point  that  can  instantaneously  turn  around.  The  purpose  of  this  section 
is  to  investigate  the  limitations  of  Algorithm  1  when  applied  to  perimeter  surveillance  using  a  team  of  UAVs. 

In  Section  1.2  agents  are  modeled  as  points  that  could  communicate  only  when  touching.  Now  consider 
UAVs  flying  at  constant  velocity  with  nominal  turning  radius  R .  To  complete  a  U-turn  with  such  a  turning 
radius  requires  that  the  UAV  be  aware  of  the  need  to  turn  around  a  distance  of  before  the  actual 
rendezvous  [21),  Since  both  UAVs  must  have  a  buffer  of  this  distance  in  order  to  complete  a  turn-around, 
the  communication  radius  of  each  UAV  must  be  greater  than  R.  When  a  UAV  becomes  aware  of  an 
immanent  rendezvous ,  it  can  calculate  where  it  should  begin  its  U-turn  maneuver  to  maintain  the  point- 
agent  motion  of  Algorithm  L 

The  constrained  turning  radius  also  affects  how  many  agents  can  monitor  a  perimeter  without  being 
interrupted  during  a  turn.  The  smallest  segment  is  one  on  which  a  UAV  can  complete  two  U- turns  con¬ 
secutively,  so  a  perimeter  of  length  P  being  monitored  by  UAVs  with  radius  R  can  have  at  most  [ 
agents. 

Consider  a  scenario  with  N  UAV  agents  that  have  a  large  enough  communication  radius  to  perform  the 
U-turn  maneuver  as  described  above.  Also,  let  the  perimeter  be  large  enough  so  that  in  steady-state  the 
segment  for  which  a  UAV  is  responsible  is  large  compared  to  the  turning  radius  of  the  vehicle  (specifically, 
larger  than  Under  what  conditions  can  we  ensure  that  during  the  transient  period,  no  UAV  is 

interrupted  while  performing  a  turning  maneuver?  In  other  words,  when  is  the  distance  between  consecutive 
direction  reversals  smaller  than  the  distance  required  to  perform  those  maneuvers? 

It  is  easy  to  show  that  when  the  scenario  allows  for  uninterrupted  steady-state  behavior  and  the  agents 
have  consistent  coordination  variables,  that  no  UAV  is  interrupted  during  the  transient  period.  However, 
during  the  time  when  the  team  is  forming  a  consistent  set  of  coordination  variables,  the  distance  between 
consecutive  direction  changes  can  be  made  arbitrarily  short  with  proper  choice  of  initial  positions  and 
directions  of  the  agents.  A  sufficient  condition  to  ensure  that  no  UAV  is  interrupted  in  the  transient  period 
is  to  enforce  a  separation  distance  of  at  least  at  launch.  In  the  general  case,  Algorithm  1  could  be 
further  modified  to  allow  interrupted  turns  to  translate  into  effective  perimeter  growth.  These  effects  will 
die  out  as  soon  as  the  coordination  variables  reach  consistency. 


1.6  Simulation  Results 

To  verify  the  feasibility  of  implementing  Algorithm  1  on  a  team  of  UAVs,  a  high  fidelity  simulation  is  per¬ 
formed.  Each  UAV  is  simulated  with  full  6  degree  of  freedom  dynamics  model  with  aerodynamic  parameters 
that  match  the  small  UAVs  flown  at  BYU  [17].  The  simulation  scenario  involved  three  UAVs  monitoring  a 
changing  perimeter  composed  of  4  waypoints  with  a  total  length  of  1,46  km.  Each  UAV  is  equipped  with 
autopilot  software  that  enables  accurate  waypoint  tracking  [17]  with  a  turning  radius  of  approximately  50 
meters.  The  communication  model  allows  UAVs  to  communicate  only  to  adjacent  neighbors  who  are  inside 
the  communication  range  of  approximately  370  meters,  the  minimum  distance  necessary. 

The  simulation  scenario  starts  with  only  two  of  the  three  UAVs  being  launched.  Each  agent  starts 
without  any  knowledge  of  the  number  of  agents  on  the  team  nor  the  perimeter  length.  Even  though  the 
perimeter  is  defined  by  predetermined  waypoints,  we  require  the  UAVs  to  initially  treat  the  perimeter  length 
as  unknown.  After  about  400  seconds,  a  step  change  in  the  perimeter  length  occurred  by  adding  an  additional 
waypoint,  followed  by  another  change  a  short  time  later.  At  approximately  900  seconds  in  simulation  time, 
the  third  UAV  was  launched.  Before  the  simulation  terminated,  the  team  experienced  two  more  changes  in 
the  perimeter  length,  one  at  each  end. 

Figure  8  shows  the  simulation  results  by  plotting  the  normalized  position  of  each  UAV  along  the  length  of 
the  perimeter.  Note  that  in  the  regions  where  the  team  should  already  be  locked  into  the  ideal  configuration, 
some  position  overlap  is  still  observed.  This  is  caused  by  the  inability  of  the  UAVs  to  perform  the  U-turn 
maneuver  precisely,  and  resulted  in  a  disturbance  to  the  system.  However,  the  overall  behavior  of  the 
team  was  as  expected,  with  the  agents  reaching  the  desired  steady-state  behavior  quickly  and  reacting 
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Figure  8:  Simulation  results  showing  the  normalized  position  of  each  UAS  along  the  perimeter.  Changes 
to  the  perimeter  length  occurred  at  approximately  400,  700,  1100,  and  1600  seconds.  The  third  agent  was 
introduced  at  approximately  950  seconds.  The  sharp  peaks  are  a  result  of  the  coordination  variables  being 
reset. 

appropriately  to  step  changes  in  both  the  perimeter  length  and  team  size. 

It  should  be  noted  that  even  though  the  UAVs  cannot  turn  around  instantaneously,  the  position  plot  in 
Figure  8  showrs  the  agents  making  sudden  changes  in  direction.  This  is  a  result  of  the  UAV  resetting  the 
coordination  variables  when  it  begins  to  execute  the  turn  around  command. 


1.7  Flight  Test  Results 

The  decentralized  cooperative-surveillance  algorithm  was  further  validated  by  hardware  flight  tests  using 
the  experimental  testbed  described  [17].  Figure  9  displays  the  normalized  position  of  two  UAVs  along  the 
perimeter  while  Figure  10  shows  the  inertial  position  plots  that  were  generated  from  the  actual  telemetry 
files  of  the  UAVs.  The  latter  figure  demonstrates  the  algorithm  by  showing  (a)  some  initial  condition  for  the 
two  agents,  (b)  the  first  rendezvous,  (c)  the  turn-around  at  the  shared  border,  (d)  the  first  meeting  of  the 
perimeter  endpoints,  (e)  the  second  rendezvous,  and  (f)  the  second  meeting  of  the  perimeter  endpoints. 

The  algorithm  was  initiated  at  approximately  50  seconds,  after  the  twfo  agents  had  passed  each  other. 
The  first  UAV  (blue)  turned  around  immediately  while  the  second  agent  (red)  traveled  to  the  shared  border 
before  turning  around.  At  this  point  the  agents  had  reached  the  steady-state  configuration.  Similar  to  the 
simulation  results,  there  was  some  overlap  in  position  between  the  two  agents.  This  is  a  result  of  the  inability 
of  the  UAVs  to  complete  a  precise  U-turn  maneuver.  It  should  also  be  noted  that  the  shared- border  position 
of  the  two  agents  appears  to  be  around  60%  of  the  perimeter  length  instead  of  the  theoretically  predicted 
50%.  This  deviation  was  caused  by  wind  pushing  the  second  agent,  thereby  enabling  Agent  2  to  cover  more 
distance  than  Agent  !.  Wind  speeds  during  the  flight  tests  were  estimated  at  35%  of  the  airspeed  of  the 
UAVs.  Despite  the  disturbance  of  the  wind,  the  agents  were  still  able  to  effectively  distribute  themselves 
evenly  along  the  perimeter. 
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Figure  9:  Experimental  results  showing  the  normalized  position  of  each  U AS  along  the  perimeter.  The 
decentralized  cooperative-surveillance  algorithm  was  started  at  approximately  50  seconds, 

1.8  Conclusions 

This  section  has  presented  a  decentralized  algorithm  for  perimeter  surveillance  that  converges  in  finite  time. 
By  sharing  information  regarding  the  perimeter  length  and  number  of  team  members,  each  agent  obtains 
a  consistent  set  of  coordination  variables  that  allows  the  decentralized  algorithm  to  operate.  Advantages 
include  the  ability  to  monitor  changing  perimeters,  account  for  dynamic  insertion  and  deletion  of  team 
members,  and  operate  with  small  communication  range  in  a  decentralized  manner.  Simulation  mid  flight 
tests  were  performed  to  validate  the  effectiveness  of  the  algorithm. 
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Figure  10:  Various  plots  generated  from  the  actual  telemetry  files  of  the  UASs  collected  during  the  exper¬ 
imental  flight  tests.  These  demonstrated  the  functionality  of  the  distributed  spread  algorithm,  where  (a) 
are  the  initial  conditions,  (b)  is  the  first  rendezvous,  (c)  is  the  turn-around  at  the  shared  border,  (d)  is  the 
first  meeting  of  the  perimeter  endpoints,  (e)  is  the  second  rendezvous,  and  (f)  is  the  second  meeting  of  the 
perimeter  endpoints. 
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2  Kalman  Consensus  Strategies  and  Their  Application  to  Coop¬ 
erative  Control 

During  the  last  two  decades  there  has  been  a  dramatic  paradigm  shift  in  the  way  that  computer  systems  are 
designed:  moving  from  centralized  mainframe  computers  to  networks  of  less  capable,  but  much  less  expensive, 
personal  computers.  In  much  the  same  way,  replacing  large,  expensive,  monolithic  vehicles  with  teams  of 
networked  vehicles,  promises  less  expensive,  more  capable  systems.  In  addition,  there  are  applications  where 
a  team  of  vehicles  can  accomplish  objectives  that  would  be  impossible  for  a  single  vehicle.  For  example,  a 
formation  of  networked  spacecraft  could  be  used  to  synthesize  a  space-based  interferometer  with  base-lines 
reaching  tens  to  hundreds  of  kilometers  [22,  23].  With  teams  of  vehicles,  much  of  the  design  complexity  is 
shifted  from  mechanical  hardware  design  to  software  that  regulates  the  interaction  of  the  team. 

In  recent  years,  there  has  been  significant  interest  and  research  activity  in  the  area  of  coordinated  and 
cooperative  control  [24,  25,  26,  27,  28,  29,  30,  31].  Much  of  this  work  assumes  the  availability  of  global  team 
knowledge,  and/or  the  ability  to  plan  group  actions  in  a  centralized  manner. 

Centralized  coordination  techniques  are  suitable  if  each  member  of  the  team  has  the  ability  to  communi¬ 
cate  to  a  centralized  location  or  if  the  team  is  able  to  share  information  via  a  static  fully  connected  network. 
On  the  other  hand,  real-world  communication  topologies  are  usually  not  fully  connected.  In  many  cases  they 
depend  on  the  relative  position  of  the  vehicles  and  on  other  environmental  factors  and  are  therefore  dynam¬ 
ically  changing  in  time.  In  addition,  wireless  communication  channels  are  subject  to  multi-path,  fading  and 
drop-out.  Therefore,  cooperative  control  in  the  presence  of  real-world  communication  constraints,  becomes 
a  significant  challenge. 

In  a  recent  article  we  argued  that  “shared  information  is  a  necessary  condition  for  cooperation*1  [16]. 
Shared  information  may  take  the  form  of  common  objectives,  common  control  algorithms,  relative  position 
information,  or  a  world  map.  If  this  assertion  is  true,  then  information  exchange  becomes  a  central  issue  in 
cooperative  control.  In  this  article,  we  will  refer  to  the  information  that  is  necessary  for  coordination  as  the 
coordination  information  or  coordination  variable  [32],  In  the  presence  of  an  unreliable,  dynamically  changing 
communication  topology,  it  is  not  possible  for  all  of  the  vehicles  to  have  access  to  identical  coordination 
information.  Suppose  that  a  particular  cooperation  strategy  has  been  devised  and  shown  to  work  if  the  team 
has  global  access  to  the  coordination  information.  Cooperation  will  occur  if  each  member  on  the  team  has 
access  to  the  same  information. 

As  an  example,  consider  the  meet- for-d inner  problem  introduced  in  [16].  In  this  problem,  a  group  of 
friends  decide  to  meet  for  dinner  at  a  particular  restaurant  but  fail  to  specify  a  precise  time  to  meet.  On  the 
afternoon  of  the  dinner  appointment,  each  individual  realizes  that  they  are  uncertain  about  the  time  that 
the  group  will  meet  for  dinner.  A  centralized  solution  to  this  problem  is  for  the  group  to  have  a  conference 
call,  to  poll  each  individual  regarding  their  preferred  time  for  dinner,  and  to  average  the  answers  to  arrive 
at  a  time  that  the  group  will  meet  for  dinner.  However,  this  centralized  solution  requires  that  a  conference 
line  is  available,  and  that  the  time  of  the  conference  call  is  known  to  the  group.  Since,  whatever  algorithm 
was  used  to  convey  the  time  of  the  conference  call  to  the  group,  could  also  have  been  used  to  convey  the 
time  to  meet  for  dinner,  the  central  problem  remains. 

The  information  variable  in  this  example  is  the  time  that  the  group  will  meet  for  dinner.  The  particular 
time  is  not  what  is  important,  but  rather  that  each  individual  in  the  group  has  a  consistent  understanding 
of  that  information.  A  decentralized  solution  to  the  problem  would  be  for  each  individual  to  call,  one  at  a 
time,  a  subset  of  the  group.  Given  his  current  estimate  of  the  meeting  time,  the  individual  might  update  his 
estimate  of  the  meeting  time  to  be  a  weighted  average  of  his  current  meeting  time  and  that  of  the  person 
with  whom  he  is  conversing.  The  question  (which  will  be  answered  in  this  section)  is  under  what  conditions 
this  strategy  will  enable  the  entire  team  to  converge  to  a  consistent  meeting  time. 

Therefore,  if  a  centralized  solution  to  a  cooperation  problem,  with  its  associated  coordination  information, 
has  been  devised,  then  two  additional  questions  must  be  addressed.  First,  what  algorithms  should  be 
employed  to  ensure  that  the  team  is  converging  to  a  consistent  view  of  the  coordination  information  in 
the  presence  of  an  unreliable,  dynamically  changing  communication  topology?  Second,  if  the  action  of  the 
group  is  based  on  the  (dynamically  changing)  coordination  variable,  will  the  cooperative  control  algorithm 
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be  robust  with  respect  to  the  transient  error  in  the  coordination  variable  across  the  team? 

Convergence  to  a  consistent  view  of  the  coordination  variable  in  the  presence  of  an  unreliable,  dynamically 
changing  communication  topology  is  called  the  consensus  problem.  Consensus  problems  have  recently  been 
addressed  in  [33,  34,  35,  36,  37,  16,  38,  39],  to  name  a  few.  In  [34],  sufficient  conditions  are  given  for 
consensus  of  the  heading  angles  of  a  group  of  agents  under  undirected  switching  interaction  topologies. 
In  [35],  average  consensus  problems  are  solved  for  a  network  of  integrators  using  directed  graphs.  In  [16] 
and  [38],  an  algebraic  graph  approach  is  used  to  show  necessary  and/or  sufficient  conditions  for  consensus 
of  information  under  time- invariant  and  switching  interaction  topologies  respectively.  In  [36],  a  set- valued 
Lyapunov  function  approach  is  used  to  consider  discrete-time  consensus  problems  with  unidirectional  time- 
dependent  communication  links. 

Previous  consensus  seeking  results  reported  in  the  literature  do  not  explicitly  account  for  agent  confidence 
in  their  instantiation  of  the  coordination  variable.  Most  results  assume  that  each  individual  in  the  group 
has  identical  confidence  in  their  instantiation  of  the  coordination  variable.  However,  there  are  many  cases 
where  some  individuals  on  the  team  will  have  access  to  better  information  than  others.  In  cases  like  these, 
the  consensus  algorithm  needs  to  be  biased  to  favor  agents  with  better  information.  For  example,  if  a  team 
of  UAVs  is  tasked  with  tracking  the  location  of  a  group  of  ground  vehicles,  the  quality  of  information  will 
be  proportional  to  the  relative  sensing  distance.  UAVs  that  have  recently  flown  close  to  a  ground  vehicle 
should  be  considered  more  reliable  than  those  that  are  sensing  from  a  greater  distance,  or  whose  information 
is  old.  As  another  example,  in  the  meet- for- dinner  problem  described  above,  if  one  individual  is  considered 
more  reliable  than  the  others,  his/her  information  should  be  weighted  more  heavily  when  making  the  team 
decision. 

The  primary  contribution  of  this  section  is  to  derive  continuous-time  and  discrete-time  consensus  strate¬ 
gies,  based  on  a  Kalman-filter  structure,  that  asymptotically  achieves  consensus  in  the  presence  of  an  un¬ 
reliable,  dynamically  changing  conimun  Scat  ion  topology,  giving  proper  weight  to  individuals  with  greater 
certainty  in  their  coordination  variable.  In  addition,  we  will  show  that  the  Kalman  consensus  scheme  is 
input-to-state  stable  (ISS)  where  the  input  is  the  communication  noise  on  each  channel  and  the  state  is  the 
consensus  error  between  each  pair  of  agents.  The  ISS  property  will  be  exploited  to  develop  a  distributed 
multi- vehicle  cooperative  control  solution  to  the  cooperative  timing  problem. 

UAV  cooperative  timing  problems  have  been  investigated  recently  in  the  context  of  battlefield  scenarios 
where  the  UAVs  are  required  to  converge  on  the  boundary  of  a  radar  detection  area  to  maximize  the  element 
of  surprise  [40,  32,  24,  41,  42].  Cooperative  timing  problems  also  arise  in  refueling  scenarios,  fire  and 
hazardous  material  monitoring,  moving  area  of  regard  problems,  and  continuous  surveillance  problems.  In 
this  section  we  will  investigate  a  simplified  cooperative  timing  problem  that  must  be  accomplished  in  the 
presence  of  an  unreliable,  dynamically  changing  communication  topology. 

The  section  is  organized  as  follows.  In  Section  2.1,  we  give  an  intuitive,  non-rigorous  derivation  of  the 
Kalman- like  consensus  strategies,  and  show  their  application  to  the  meet- for-d inner  problem.  Section  2.2 
contains  the  main  technical  results  that  shows  that  the  proposed  consensus  strategies  are  convergent  under 
certain  mild  conditions.  In  Section  2.3  we  show  that  the  Kalman  consensus  scheme  is  input-to-state  stable. 
As  a  corollary,  we  show  that  most  of  the  other  consensus  schemes  proposed  in  the  literature,  are  also  ISS.  In 
Section  2.4  the  ISS  property  is  exploited  to  develop  a  distributed  solution  to  the  cooperative  timing  problem. 
Finally,  Section  2.5  contains  our  conclusions. 

2*1  Kalman-filter  Approach  to  Multi-Agent  Consensus 

The  Kalman  filter  is  used  extensively  to  estimate  a  system's  current  state  from  imprecise  measurement 
data  [43,  44,  45].  It  is  well-known  that  the  Kalman  filter  is  an  optimal  estimator  in  the  case  of  Gaussian 
statistics  and  that  it  is  the  best  linear  estimator  in  the  case  of  other  statistics  [46].  Motivated  by  the  Kalman 
filter  scheme,  we  treat  the  final  consensus  value  as  the  system  state,  which  is  unknown  a  priori  but  is  the  final 
equilibrium  state  that  each  agent  in  the  group  is  expected  to  achieve.  In  the  consensus  problem,  each  agent 
has  an  estimate  of  the  final  consensus  value.  Communication  from  other  agents  regarding  their  estimate 
of  the  final  consensus  value  will  be  regarded  a s  measurement  data.  In  this  sense,  each  agent  in  the  group 
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performs  its  own  estimate  of  the  final  consensus  value  based  on  the  information  available  to  it.  Our  goal  is 
to  guarantee  that  the  information  state  of  each  agent  achieves  the  final  consensus  value.  In  other  words,  the 
objective  is  to  minimize  the  mean  squared  error  between  each  agent’s  estimate  of  the  coordination  variable 
and  the  final  consensus  value.  The  error  covariance  matrix  is  interpreted  as  the  confidence  that  each  agent 
has  in  its  current  estimate  of  the  coordination  variable*  where  large  covariance  indicates  low  confidence,  and 
small  covariance  indicates  a  high  degree  of  confidence.  In  Section  2.  LI  we  will  derive  the  Kalman- consensus 
scheme  for  a  continuous- time  update  scheme,  and  in  Section  2J/2  we  will  address  the  discrete-time  case. 
Analytical  properties  of  the  algorithms  will  be  derived  in  Section  2.2. 

2.1*1  Continuous-time  Consensus 

System  model  and  measurement  model; 

x  —  Ax  +  Bu  +  Gw 
z  —  Hx  +  v 

i(0)  ~  (x0,  P0),  ~  (o,  Q),  v  ~  (0,  R) 

Assumptions: 

{u;(t)}  and  {u(£)}  are  white  noise  processes  uncorrelated  with  x(Q) 
and  with  each  other.  J?  >  0. 

Initialization; 

P(0)  —  Po>£(Q)  =  xq 
Error  covariance  update: 

JP  =  AP  +  PAt  +  GQGt  -  PHtR^HP 

Kalman  gain: 

K  =  PHtR~ 1 
Estimate  update: 

x  —  Ax  +  Bu  +  K  (z  —  Hx) 


Table  I:  Continuous- time  Kalman  filter  [47]. 

The  standard  continuous-time  Kalman  filter  is  summarized  in  Table  2. LI  [47].  The  objective  of  this 
section  is  to  show  how  the  Kalman  filter  equations  can  be  used  to  derive  a  decentralized  information  consensus 
scheme. 

Let  €  Rm  be  the  a  priori  unknown  information  state  over  which  the  team  is  to  form  consensus.  In 
other  words,  each  information  state  will  converge  to  the  consensus  value  as  t  — * ►  co.  Note  that  the 
consensus  value  will  depend  not  only  on  interaction  topologies  but  on  the  weighting  factors  in  the  update 
schemes.  In  this  section  we  will  assume  that  the  consensus  state  is  a  constant,  which  implies  that  the  system 
dynamics  are  given  by 

C  -  ttft 

where,  with  reference  to  Table  2.1.1,  A  =  0,  B  -  0,  G  =  /m,  and  £{wu;T}  —  Q .  In  the  following,  we  assume 
that  <J(f}  >  0  is  uniformly  lower  and  upper  bounded. 

Treat  the  iih  information  state  as  the  ith  agent’s  estimate  of  f  *  and  suppose  that  the  agent 
communicates  to  the  ith  agent  with  transmission,  or  communication  noise  Also,  let  ^j(f)  be  a  time- 
varying  boolean  variable  that  indicates  the  presence  of  an  open  communication  channel  from  agent  j  to 
agent  i  at  time  i.e.,  —  1  if  information  is  communicated  from  j  to  i  at  time  t  and  zero  otherwise. 

Note  that  =  1-  Using  these  definitions,  it  is  clear  that  the  measurement  model  of  the  ith  agent  can  be 
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given  by 


/  9n  (Ci  +  ^il) 


\9iN  (Cjv  4*  ViN)j 


(  9iJ\ 


\9iNlJ 


{*  + 


(  9n  (Cl  ”  C*  +  ^il) 

{9iN  (CiV  -  C*  +^tN)> 


where,  with  reference  to  Table  2,1.1, 


and 


Hf  =  (gnl  *.<  5,n/) 


= 


0ti  (Ci  -  C*  +  ) 


(Cw  -f*  +  ^n)  ■ 


If  we  define  P*  =  P{(£,  -  C*)(Ct  ~C*)T)  and  assume  that  £{(&  -  C*)(Cj  -  C*)T} 


Pi  =  E{Vivf} 

fgniPi  +  ^u) 

V  o 


0  > 
g%w(Pft  +  niN)J 


where  is  assumed  to  be  upper  bounded. 

Therefore,  the  error  covariance  update  in  Table  2. LI  becomes 


Pl^-PiHrfR-lHiPi  +  Q 

N 

=  -Fi$29ij(Pj+Slij)-i)Pi  +  Q. 

J  =  1 


Similarly,  the  Kalman  gain  is  given  by 

Kt  =  PiHlH T1 

-(«ifl(Pi+nii)“l  9inPi(PN+niN)-l)t 
and  the  estimate  update  is  given  by 

ii  -  Kite  -  Hiti) 


Ki 

<  9.i(Si+*'ii}  > 

(  y> l/m  \ 

:  U 

^  9m(C/V  +  ViN)  / 

\  9iN^m  ) 

N 

~  JZ  ”  C»  +  ^ij  )j 


where  K*  =  {KiuKi2r-"  >Kinl 
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=  0,  where  i  ^  j,  then 
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Summarizing,  we  have  the  following  Kalman  consensus  scheme  for  the  ith  agent: 


Pi  =  -Pi 


53  i  W  ( Pj  +  ) 


K  i  =  fl(Pj  +  ny) 


-i 


i  2ij(t)Kij  (fe  +  Vii)  -  6)  • 

j=l 


(8) 

(9) 

(10) 


Note  that  Eq.  (8)  indicates  that  the  certainty  of  information  increases  with  communication  but  decreases 
with  the  size  of  the  process  noise.  In  addition,  the  rate  of  increase  in  certainty  for  the  ith  agent  is  inversely 
proportional  to  the  certainty  of  the  jth  agent  and  the  communication  noise.  Note  also  that  the  Kalman  gain 
Ktj  is  reduced  if  either  the  communication  noise  is  large,  or  if  the  certainty  of  the  jih  agent  is  small  {hence  P7 
large).  Note  that  Eq.  {10}  is  similar  to  the  continuous-time  consensus  schemes  proposed  in  (34,  35T  16]  except 
that  the  consensus  gain  K\j  is  time^ varying  in  (10),  and  the  communication  noise  is  explicitly  included. 


2.1.2  Discrete-time  Consensus 


System  model  and  measurement  model: 
x{k  +  1}  =  A[k]x[k)  +  B[k]u[k]  +  C[k]w[k] 
z  [fc]  —  //[fc]x[fc]  -f 

x(0)  -  (xo>P*a),w  -  (0 ,Q[fc]),  v[fc]  ~  (0,  AM) 

Assumptions: 

[w[k]}  and  {u[fc]J  are  white  noise  processes  uncorrdated  with  x0 
and  with  each  other.  /£[£]  >  0+ 

Initialization: 

p[0]  =  Pxo,xo=x o 

Time  update:  (effect  of  system  dynamics) 

error  covariance:  F[A:  +  lj“  =  j4[fc]-P[/e]AMr  4-  G[&jQ[fc]G[A:]T 
estimte:  x[A:  -|- 1]"^  =  A[A:]x[fc]  +  £[&]u[fc] 

Measurement  update:  (effect  of  measurement  *{&]) 

error  covariance:  P[A:  +  1]  =  [(P[k  +  l]”)_l  +  H[k  +  I]T/?[fc  +  lJ-1//[A:  +  1] J “ 1 

estimate:  x[k  +  1]  =  x[k  +  l|~  +  P[k  +  1) H[k  +  1]tJ£[At  +  1  ]’l[z[k  +  1]  -  H[k  +  l]x[k  +  1]“) 


Table  2:  Discrete- time  Kalman  filter  [47], 

The  standard  discrete-time  Kalman  filter  is  summarized  in  Table  2.1.2  [47].  Again  assuming  that  is 
constant  we  get 

V[k  +  lJ-fM  +  M*]. 

where,  with  reference  to  Table  2.1.1,  A  [A:]  =  /,  i?[Ar]  —  0,  G[Ar]  =  /mt  and  £{tuMtu[fc]r}  =  Q[k]> 

Again  letting  [A-]  represent  the  communication  noise,  the  measurement  model  for  the  itb  agent  can  be 
given  by 
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Zj[Jc]  = 


Ki  ft]  +  *'«[*])  \ 


^s>wft]  (6tftl  +  "ijvft]}y 
Shift]  A 

e*[fc]  + 


/  Sa  ft]  (6 ft]  -  V  ft]  +  "a  ft])  > 
\ffwft]  (fcvftl  -Cft]  +  vnf[k])J 


{giN{k]lJ 

where,  with  reference  to  Table  2.1.1, 

and 

(  fit  1  ft]  (£1  ft]  ~  £*ft]  +  m  ft] ) 

Vi  =  : 

^SiAfft]  (£wft]  -  £*ft]  +  viN[k})j 

If  we  define  4  ft]  =  fi{(6[fc]  -C*W)(6W  -  £*ft])T}  and  assume  that  £{(£#]  -£*ft])(0 ft]  -rft])T}  =  0. 
where  i  ^  jT  then 

P<ft]  =  £{u,ft]tj,ftjr} 


/ftiWtaw+OiiW) 


where  fly  ft]  =  £{i'ijftftijft]r}- 

Therefore,  the  time  update  in  Table  2.1.2  becomes 


fflwl*3(flvt*]+nw[fc]), 


-i 


Pfft+ I] -4ft] +<?[*] 

erft  +  i]  =&[*]■ 

The  measurement  update  is  given  by 

4ft  + 1]  =  [(P-[fc  +  l])-1  +  Hf[k  +  ljfl^ftlft  +  ljifj  [A:  +  1]] 

=  Km  +  Qft])"1  +  f>yft](4ft]  +  [fc])-1]-1. 

3= 1 

(dk + 1] = r  [fc+ ij  +  m + 1  )Hj\k + ijp-'ifc + ii  (zi[k + ij  -  Hi\k  +  ijr  [* + 1]) 

-  m  +  Pi[k  + 1]  ^  [syft](4ft]  +  fly)"1  ft]  <(£,[*]  +  t/yft  +  11)  -  6  ft])]  j 

Summarizing,  we  have  the  following  discrete-time  Kalman  consensus  scheme  for  the  iKh  agent: 

m  + 11  =  [(4ftl  +  Qft])"1  +  X>;ftK4ft]  +  nyfti)"1!"1. 

J=1 

ti[k + ii = m + m + ij  [sij[fci(Pj[&i + slur1®  im + + id  -  6  ft])]  J . 


on 

(12) 


22 


BYU 

2.2  Convergence  Results  MagiCC  Lab 


2*1,3  Meet  for  Dinner  Example 

To  illustrate,  consider  the  meet-for-d inner  problem  discussed  in  the  introduction.  Suppose  that  there  are 
Af  —  10  agents  who  communicate  with  exactly  one  other  individual ,  chosen  randomly  from  the  group,  for  a 
random  length  of  time.  After  the  communication  has  expired ,  the  process  is  repeated*  Figure  11  shows  the 


Figure  11:  Continuous- time  meet- for- dinner  simulation.  The  subplot  in  the  upper  left  shows  the  evolution 
of  the  coordination  variable  assuming  that  all  agents  begin  with  equally  confident  covariance*  The  subplot 
in  the  lower  left  shows  the  associated  covariance.  The  subplots  on  the  right  show  identical  data  where  the 
agent  with  initial  time  &  =  7  has  an  initial  covariance  of  Pi  =0.001. 

state  and  variance  plots  under  the  continuous  Kalman  consensus  scheme  (8)-{lG)  where  the  initial  state  is 
uniformly  assigned.  The  subplots  on  the  left  show  the  arrival  times  and  variance  when  the  initial  variances 
are  uniformly  assigned.  The  subplots  on  the  right  show  the  arrival  times  and  variances  when  the  %Tariance  of 
the  agent  with  initial  arrival  time  £,  =  7  is  given  an  initial  variance  of  Pi  —  0.001,  which  is  significantly  lower 
than  the  other  agents.  Note  that  in  this  case,  the  final  consensus  value  is  influenced  to  a  greater  degree  by 
this  agent.  Figure  12  shows  similar  plots  using  the  discrete  Kalman  consensus  scheme  (11)-{12).  Both  the 
conti nuous-ti me  and  discrete-time  simulations  use  the  values  =0*1,  Q  =  0.1. 


2,2  Convergence  Results 

The  objective  of  this  section  is  to  state  some  technical  properties  of  the  algorithms  given  in  Eqs.  (8)-( 10) 
and  Eqs.  (11}-(12).  For  notational  simplicity,  we  will  focus  on  the  case  where  each  information  state  £* 
is  a  scalar.  The  vector  case  reduces  to  the  scalar  case  if  P{q  is  a  diagonal  matrix.  The  general  case  where 
Pio  is  non -diagonal  is  currently  a  topic  of  research.  In  Section  2*2.1,  we  will  introduce  some  notation  and 
results  From  graph  theory  and  non-negative  matrices  that  will  be  used  in  the  convergence  arguments.  In 
Section  2.2,2  we  analyze  the  conti nuous-time  case  and  in  Section  2.2,3  we  analyze  the  discrete- time  case. 

2,2*1  Preliminaries 

Let  A  —  €  X),  where  2  =  {1,2,  ,7i},  be  a  set  of  n  agents  among  whom  consensus  is  desired.  A 

directed  graph  G  will  be  used  to  model  the  interaction  topology  among  these  agents*  In  the  vertex 
represents  the  tth  agent  A{  and  a  directed  edge  from  Ai  to  Aj  denoted  as  (Ai,Aj)  represents  a  unidirectional 
information  exchange  from  At  to  Aj,  that  is,  agent  j  receives  information  from  agent  i,  (i,  j)  6  2-  If  the 
information  flows  from  agent  i  to  agent  j,  agent  i  is  called  the  parent  of  jt  and  agent  j  is  called  the  child  of  i. 
A  directed  path  in  graph  Q  is  a  sequence  of  edges  (Ailf  A^),  (Ai31Ah),  {Al2,AtJr  in  that  graph.  Graph 
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Figure  12:  Discrete-time  meet- for- dinner  simulation.  The  subplot  in  the  upper  left  shows  the  evolution  of 
the  coordination  variable  assuming  that  all  agents  begin  with  equally  confident  covariance.  The  subplot  in 
tlie  lower  left  shows  the  associated  covariance.  The  subplots  on  the  right  show  identical  data  where  the  agent 
with  initial  time  =  7  has  an  initial  covariance  of  Pi  =  0.00  L 


G  is  called  strongly  connected  if  there  is  a  directed  path  from  A*  to  A3  and  A3  to  A%  between  any  pair  of 
distinct  vertices  Ai  and  Aj,  V(i,  j)  €  X.  A  directed  tree  is  a  directed  graph,  where  every  node,  except  the 
root,  has  exactly  one  parent.  A  spanning  tree  of  a  directed  graph  is  a  directed  tree  formed  by  graph  edges 
that  connect  all  the  vertices  of  the  graph  [48],  We  say  that  a  directed  graph  has  a  spanning  tree  if  there 
exists  a  spanning  tree  that  is  a  subset  of  the  directed  graph.  Fig.  13  shows  a  directed  graph  with  more  than 
one  possible  spanning  trees.  The  double  arrows  denote  one  possible  spanning  tree  with  As  as  the  parent. 
Spanning  trees  with  A \  and  A4  as  the  parent.,  are  also  possible.  As  a  comparison,  Figs.  14  shows  two  cases 
where  the  graph  does  not  have  a  spanning  tree. 


Figure  13:  A  directed  graph  that  has  more  than  one  possible  spanning  trees,  but  is  not  strongly  connected. 
One  possible  spanning  tree  is  denoted  with  double  arrows. 

The  interaction  topology  may  change  dynamically.  Let  Q  =  {Si ,£2,  -  ,G\t}  denote  the  set  of  all 
possible  directed  interaction  graphs  defined  for  A.  It  is  obvious  that  §  has  a  finite  number  of  elements  and 
that  £?(£)  e  Q.  The  union  of  a  set  of  directed  graphs  {Si, ,  Gi2,  *  ■  ■  ,  Gim  }  C  Q  is  a  directed  graph  with  vertices 
given  by  AiT  f  e  Z  and  edge  set  given  by  the  union  of  the  edge  sets  of  ,  j  —  1,  *  ■  ■  We  will  assume 
throughout  the  section  that  the  interaction  topology  does  not  switch  infinitely  fast. 

Let  A/„(R)  represent  the  set  of  all  n  x  n  real  matrices.  Given  a  matrix  A  =  [a,j]  e  Mn(R ),  the  directed 
graph  of  A,  denoted  by  F(A),  is  the  directed  graph  on  n  vertices  i  €  Z,  such  that  there  is  a  directed  edge 
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Figure  14:  (a)  A  directed  graph  that  has  two  leaders,  and  hence  does  not  contain  a  spanning  tree,  (b)  A 
directed  graph  that  has  two  isolated  groups,  and  hence  does  not  contain  a  spanning  tree* 


in  P(A)  from  Vj  to  V*  if  and  only  if  ^  0  [19],  For  example,  the  directed  graph  of  the  matrix 


A  = 


0  0  0  0  ais 

0-21  0  o  0  <225 

0  032  0  0  0 

aAi  0  0  0  0 

0  0  0  a54  0 


where  a ^  ^  0,  corresponds  to  the  graph  in  Fig,  13, 

A  matrix  A  —  [a^]  E  Mn(E)  is  noimegative,  denoted  as  A  >  0,  if  all  its  entries  are  nonnegative. 
Furthermore,  if  all  its  row  sums  are  +1,  A  is  said  to  be  a  (row)  stochastic  matrix  [49],  A  stochastic  matrix 
P  is  called  indecomposable  and  aperiodic  (Sf  A)  if  limn_w  Pn  m  lyTi  where  y  is  a  column  vector,  and  1 
denotes  an  nx  1  column  vector  with  all  the  entries  equal  to  1  [50] .  For  nonnegative  matrices,  A  >  B  implies 
that  A  -  B  is  a  nonnegative  matrix.  It  is  easy  to  verify  that  if  A  >  pB,  for  some  p>  0,  then  the  directed 
graph  of  B  is  a  subset  of  the  directed  graph  of  A. 

Two  72  x  n  non  negative  matrices  are  said  to  be  of  the  same  type  if  their  zero  elements  are  in  the  same 
locations  [50] .  We  will  use  the  notation  P  **  Q  to  denote  that  P  and  Q  are  of  the  same  type. 


Lemma  2,1.  Given  nxn  nonnegative  matrices  Pf  Qf  Rt  and  S,  i/P  ~  R  and  Q  S,  then  ( P+Q )  rvr  (fl+S) 
and  PQ  ^  RS.  Moreover,  if  a  time- varying  nonnegative  matrix  M(t)  with  continuous  entries  is  of  a  fixed 
type  for  t  E  [ti,  *2],  where  ti  <  then  M(t)  ^  f**  M(t)dt. 

Proof;  Trivial,  ■ 

Let  fa  E  R,  i  G  I,  represent  the  ith  information  state  associated  with  the  ith  agent.  The  set  of  agents 
A  is  said  to  achieve  consensus  asymptotically  if  for  any  &(0),  i  E  2,  ^ (£) f[  0  as  t  —  00  For  each 

(*ij)  el. 


2,2.2  Continuous-time  Consensus 

The  following  theorem  is  our  main  technical  result. 

Theorem  2.1*  Given  switching  interaction  topologies  and  zero  transmission  or  communication  noise t  the 
Kalman  consensus  scheme  given  in  Eqs ,  (8) -(10)  achieves  asymptotic  consensus  if  there  exist  infinitely  many 
consecutive  uniformly  bounded  time  intervals  such  that  the  union  of  the  interaction  graph  across  each  interval 
has  a  spanning  tree. 

The  proof  of  this  theorem  depends  upon  the  following  five  lemmas. 

Lemma  2,2.  Let  C(t)  —  (i )]  e  Afn(R)  he  piecewise  continuous,  where  Cij  >  0,  i  /  jf  and  Cij  =s  0.  Let 
®c(t,  to)  be  the  cort'e  spending  transition  matrix.  Then  w  a  stochastic  matrix  with  positive  diagonal 

entries  for  any  t  >t0. 
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Proof:  From  [51]*  we  know  that 
$c(Ma) 

=  /+  /  C{<T\)d<Ji  +  f  C{<Ti)  f  C(a2)da2dfTi  4 - .  (13) 

^0  ■'to  J  Co 

Noting  that  C(£)l  =  0*  where  1  is  a  column  vector  of  ones,  we  can  verify  that  4>c(Mo)l  —  1- 

Note  that  C(£)  can  be  written  as  £(£)  -  ftlnr  where  B(t)  is  a  nonnegative  matrix  and  ft  is  a  constant 
greater  than  maxT6jt0(tj  ma xi€j  \ch(t)\.  It  is  straightforward  to  see  that 

jUc(Mo)  =  <Wc(Mo) 

and 

=  (B(t) -»!„)* B&tje-**-* °> 

and  that  ^c(^o,to)  —  ®n(tQ,  £o)e_^to_t°^  =  /.  Therefore,  we  obtain  From 

Eq.  (13),  it  is  straightforward  to  see  that  <$#(£,  to)  is  nonnegative  and  has  positive  diagonal  entries.  Therefore, 
it  follows  that  fo)  is  nonnegative  and  has  positive  diagonal  entries.  Combining  these  arguments  implies 
that  the  transition  matrix  4>c{t,£o)  is  a  stochastic  matrix  with  positive  diagonal  entries.  ■ 

Lemma  2*3*  Let  C(t)  =  [C|j(£}]  €  Mn(R)  and  C  —  [ctj(£)]  €  Mn(R)  be  continuous  ont  €  [r,  s],  where  s  >  r 
such  that  Cij(t)  >  0  and  ccj(i)  >  0,  Vi  #  j,  and  Cy(t)  =  ££=1  cy{t)  =  0.  Let  Qcfar)  and  $a(s,r) 
6e  £/ie  corresponding  transition  matrices.  A/so  /e£  the  associated  with  C(t)  be  fixed  for  t  6  [t,  s]  and 

suppose  that  C(t)  corresponds  to  the  same  fixed  graph  as  C(t).  Then  the  graph  of  C(t)  is  a  subset  of  the 
graph  o/4>c{s,7-)  and  <&c(s,r)  (s  j  t)  . 

Proof:  Let  C(t)  =  B(t)  -  plnj  where  B(t)  is  a  nonnegative  matrix  and  ft  is  a  constant  greater  than 
max«e|T,a]  maxi€x(c^(i)|.  Following  Lemma  2*2,  we  know  that  4>c{3,t)  =  $e(sf  r)€“^“r>.  Note  that 
the  graphs  associated  with  C*(t)  and  B(t)  are  the  same,  so  are  the  graphs  associated  with  $c(s,t)  and 
Therefore  from  Eq.  (13),  we  can  see  that  4>e(s,r)  >  f*  B{o\)du\ ,  where  f*  B(<Ji)dcri  *v  B(t)  for 
t  €  [r,  s],  or  in  other  words,  the  graph  associated  with  £?(£}  for  t  6  [r,  sj  is  a  subset  of  the  graph  associated 
with  $ff{sTr).  Therefore,  the  graph  associated  with  C{t)  for  t  e  [r,s]  is  a  subset  of  the  graph  associated 
with  $c(*|T)* 

Note  that  $^{s,r}  =  $3(5,  where  C  =  B  —  ftln.  In  order  to  show  that  is  of  the  same 

type  as  we  need  to  show  that  is  of  the  same  type  as  Note  that  B  and  B  are  of  the  same  type 
since  they  correspond  to  the  same  graph.  By  writing  #£  and  as  in  Eq.  (13)  and  comparing  each  term, 
Lemma  2*1  implies  that  each  corresponding  term  is  of  the  same  type,  which  in  turn  implies  that  4>#(s,  r) 
and  (I,  f)  are  of  the  same  type.  ■ 

Lemma  2*4*  Let  Sa  »  {AifA2r*m  yM}  be  a  set  of  stochastic  matrices  with  positive  diagonal  entries ,  // 
the  graph  associated  with  Ax  has  a  spanning  tree ,  then  Ai  is  SI  A.  If  the  union  of  the  graphs  of  matrices  Ai} 

2  =  1,  ■  *  ■  yi,  has  a  spanning  tree ,  then  the  matrix  product  n£=1At  is  SI  A* 

PiYtof:  The  first  statement  is  shown  in  Corollary  3.5  and  Lemma  3.7  in  [38].  For  the  second  statement,  note 
that  the  product  of  stochastic  matrices  is  still  a  stochastic  matrix.  Also  note  that  Ilf=1At  >  Ax  for 

some  7  >  0  according  to  Lemma  2  in  [34]*  Since  the  union  of  the  graphs  of  matrices  in  Sa  has  a  spanning 
tree,  it  is  obvious  that  the  graph  associate  with  1  Ai  has  a  spanning  tree.  Therefore,  it  can  be  seen 
that  the  graph  associated  with  the  matrix  product  has  a  spanning  tree,  which  in  turn  implies,  from  the  first 
statement  of  the  Lemma,  that  the  matrix  product  is  SI  A.  ■ 
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Lemma  2,5-  Let  C(t)  —  [ctj(f)]  €  Afn(R)  be  piecewise  continuous  for  t  €  [r,  s],  where  s  >  r  is  bounded , 
Cij  >  0,  i  ^  jf  and  Ct3  =  0.  If  the  union  of  the  directed  graphs  of  matrix  C(t)  for  t  E  fr,  s]  has  a  spanning 
tree,  then  the  transition  matrix  ^c(str)  ^  SIA. 

Proof:  Note  that  $c(s,r)  =  U-l)  -  •  *  $c(<i»^}t  where  tjt  j  —  I,-**  yt,  denotes  the  times 

when  C(t)  is  discontinuous.  From  Lemma  2,3,  we  know  that  the  graph  associated  with  C(t)  for  each 
t  E  [£»_ i T ft]  is  a  subset  of  the  graph  associated  with  $c(UiU-i)i  *  —  It1”  >£  +  1,  In  other  words,  if  the 
union  of  the  directed  graphs  of  matrix  C{f)  has  a  spanning  tree,  so  does  the  union  of  the  directed  graphs 
of  the  corresponding  transition  matrices.  Also  note  from  Lemma  2.3  that  each  is  a  stochastic 

matrix  with  positive  diagonal  entries.  The  proof  then  follows  from  Lemma  2.4,  ■ 

Before  moving  on,  we  need  the  following  definition  from  [50].  Given  a  stochastic  matrix  5  =  [stj]  E 
A/r,(K.),  define 

A(S)  =  1  - 

* — 

J 

Note  that  A(S)  <  1  for  any  stochastic  matrix  S.  If  A(S)  <  1,  S  is  called  a  scrambling  matrix.  X(S)  =  0  if 
and  only  if  the  rows  of  S  are  identical.  The  introduction  of  A  will  be  useful  for  the  proof  of  Theorem  2.1. 

Lemma  2.6*  (See  [50].)  Let  S  =  {SifSa,  -  -  -  ,5*}  be  a  finite  set  of  SIA  matrices  with  the  property  that  for 
each  sequence  Sit  t  Si^  i *  >  s  ij  of  positive  length ,  the  matrix  product  SijSi.^i  ■■■£*,  is  SIA.  Then  for  each 
infinite  sequence  su,si2,...  there  exists  a  column  vector  v  such  that 

.lim  SiiSii_l--Su=lvT.  (14) 

]—*QQ 

In  addition ,  in  the  case  that  S  is  an  infinite  set ,  A(VF)  <  1,  where  W  =  SkxS^  -  *  ■  and  Nt  is  defined 

as  the  number  of  different  types  of  alln  xn  SIA  matrices.  Furthermore *  tf  there  exists  a  constant  0  <  d  <  1 
satisfying  A  (IF)  <  d,  then  Eq .  (14)  also  holds . 

Proof:  See  Lemma  4  and  the  concluding  remarks  in  [50]*  ■ 

Proof  of  Theorem  2*1: 

From  Eq.  (8),  we  see  that  Pi  >  0  is  uniformly  lower  bounded  since  Qi  is  uniformly  lower  bounded.  Also 
noting  that  <  —Ff/(P- \  +  flu),  we  know  that  Pi  is  uniformly  upper  bounded, 

FVoin  Eq.  (9),  we  can  see  that  Kij(t)  >  0 j,  is  uniformly  lower  and  upper  bounded. 

Let  be  an  infinite  time  sequence  corresponding  to  the  times  at  which  graph  t?(t)  switches 

topology.  Since  the  interaction  topology  cannot  switch  infinitely  fast,  we  assume  that  f*  —  >  ti,y 

Vi  2=  1,2,  ■  ■  ♦ .  Note  that  each  interval  [U-u  U)  can  be  divided  into  finite  or  infinite  number  of  subintervals 
such  that  the  length  of  each  subinterval  is  greater  than  or  equal  to  ti  but  less  than  or  equal  to  Im  =  2ft 
and  the  graph  on  each  subinterval  is  fixed.  Relabel  these  subinteTvals  as  sq,  s\7  *  *  - . 

Without  transmission  or  communication  noise,  Eq,  (10)  can  be  rewritten  in  matrix  form  as  £  =  A (t)f, 
where  £  =  [(),■-  ,£n]r  and  A(t)  =  (Ajj(f)]*  As  mentioned  above,  the  solution  can  be  denoted  as  £(t)  = 
^(t,  Sj)${Sj,  Sj_i)  — '  ^(si ,  so)^(so)t  where  $  is  the  transition  matrix.  Noting  that  Aij(t)  =  gij(t)Kij(t), 
Vj  i,  and  Aij  (f)  =  0,  we  know  that  A(t)  is  continuous  and  satisfies  the  hypothesis  of  Lemma  2,3 
for  t  €  Noting  that  Kifft)  is  uniformly  lower  and  upper  bounded,  we  know  that  each  nonzero, 

that  is,  positive,  entry  A^,  where  i  ^  jf  satisfies  the  property  that  A^  E  [A L,  Aa/]>  which  is  a  compact  set. 
In  addition,  =  Ajj,  which  is  also  in  a  compact  set.  In  the  case  that  the  interaction  topology  is 

switching  with  time,  there  are  a  finite  number  of  possible  interaction  topologies.  For  each  possible  interaction 
topology,  note  that  matrix  A(t)  has  the  same  structure  in  the  sense  that  positive,  zero,  and  negative  entries 
are  in  the  same  places  for  t  E  [sj_i,Sj].  From  Lemma  2.3,  each  transition  matrix  Sj_i)  is  a  stochastic 
matrix,  where  ii  <  Sj  -  8j-\  <  tAf,  and  is  of  constant  type  over  this  interval,  for  each  possible 

interaction  topology.  Combining  the  above  arguments  with  the  fact  that  Sy-i)  is  a  continuous  function 
of  Aij(t)  for  t  €  we  see  that  each  nonzero  entry  of  $(sj,Sj_i)  is  lower  bounded  for  each  possible 

interaction  topology.  It  is  straightforward  to  see  that  there  are  only  finitely  many  types  for  4?(sj,Sj_i}.  We 
know  that  there  exists  a  sequence  of  unions  of  the  directed  interaction  graphs  across  some  time  intervals 
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and  each  union  is  uniformly  bounded  and  has  a  spanning  tree*  Thus  the  transition  matrix  for  each 
union  is  a  product  of  finitely  many  matrices  Ftom  Lemma  2.1,  the  type  of  is  uniquely 

decided  by  the  order  and  type  of  each  element  in  its  product.  Also,  from  Lemma  2,5,  we  know  that  each 
is  SIA.  In  addition,  noting  that  the  graph  associated  with  each  has  a  spanning  tree,  we  sec  that 
any  number  of  products  of  js  also  SIA  according  to  the  second  part  of  Lemma  2.4.  Noting  that  4^*) 
can  only  have  finitely  many  types,  we  see  that  for  each  type  of  its  nonzero  entries  are  lower  bounded* 
Let  W  —  *  * ,  $(j/vt+i)a  From  the  second  part  of  Lemma  2.6,  we  know  that  A(W)  <  I.  Note  that 

W  can  only  have  finite  many  types,  denoted  as  Wt ■  In  order  to  show  that  A(H/)  <  d  <  1,  it  is  sufficient  to 
show  that  for  each  type,  there  exists  a  0  <  d*  <  1  such  that  A(W)  <  d*.  This  can  be  verified  by  noting  that 
the  nonzero  entries  of  W  are  lower  bounded  for  each  type.  Let  d  =  max{di,  efei  *  *  •  *dwt  }•  It  is  obvious  that 
A(IV')  <  d.  From  Leimna  2,6,  we  can  show  that  £(f)  — +  i^T^(0},  where  v  is  a  nonnegative  column  vector,  ■ 
In  previous  results  on  consensus  [34,  38],  the  coefficient  matrix  C(t)  was  assumed  to  be  piecewise  constant 
with  finite  dwell  time,  and  elements  drawn  from  a  finite  set.  The  following  corollary  of  Theorem  2,1  shows 
that  these  conditions  can  be  relaxed. 

Corollary  2.2.  Let  £  =  C(t)£,  where  C(t)  =  [c*j(t)]  €  A/n(R)  is  piecewise  continuous ,  dj  >  0,  i  ^  j, 
Ylj  =  0,  and  each  nonzero  entry  c*j,  i  ^  jt  is  both  uniformly  lower  and  upper  bounded.  Under  switching 
interaction  topologies,  £ »  achieves  consensus  if  there  exist  infinite  many  consecutive  uniformly  bounded  time 
intervals  such  that  the  union  of  the  interaction  graph  across  each  such  interval  has  a  spanning  tree. 


2,2.3  Discrete-time  Consensus 


Theorem  2,3.  Given  switching  interaction  topologies  and  zero  transmission  or  communication  noise,  the 
discrete- time  Kalman  consensus  scheme  listed  in  Eq.  {11)^(12)  achieves  asymptotic  consensus  if  there  exist 
infinitely  many  consecutive  uniformly  bounded  time  intervals  such  that  the  union  of  the  interaction  graph 
across  each  interval  has  a  spanning  tree. 

Proof:  Without  transmission  or  communication  noise,  Eq,  (12)  can  be  written  as 


ft[*  +  l]  = 


1  -  A[*  +  1]  X><#]  (*,[*]  +  tty)”1 

}±i 


&[*] 


+  Pl[k  +  1]  ^  W]  ■ 

ii4i 


(15) 


Note  that  each  weighting  factor  of  £t  is  less  than  or  equal  to  1  and  the  sum  of  the  weighting  factors  of  £t  is 
equal  to  1,  where  £  G  2.  Letting  £  —  [fj,  ■  -  ■  ,$n]r,  we  can  rewrite  Eq.  (15)  as  £[&  -f  1]  —  0[&]£[fc],  where  it 
can  be  verified  that  is  a  stochastic  matrix  with  positive  diagonal  entries.  In  addition,  for  each  possible 
interaction  topology,  /)[&]  is  of  the  same  type  and  its  nonzero  entries  are  lower  bounded. 

We  know  that  there  exists  a  sequence  of  unions  of  the  directed  interaction  graphs  across  some  time 
intervals  and  each  union  is  uniformly  bounded  and  has  a  spanning  tree*  Let  D bJ  be  the  product  of  matrices 
/)[&]  over  the  ith  union.  Note  that  each  is  SIA  from  Lemma  2*4.  As  a  result,  the  proof  follows  the  same 
reasoning  as  the  proof  of  Theorem  2,1  with  playing  the  role  of  &kK  ■ 


2.3  Consensus  Schemes  are  Input-to-State  Stable 

We  are  primarily  interested  in  the  application  of  consensus  algorithms  to  cooperative  control  problems. 
In  this  section  we  will  explore  a  control  architecture  where  a  consensus  algorithm  is  in  cascade  with  a 
coordination  algorithm,  as  shown  in  Figure  15.  Our  purpose  in  this  section  is  to  derive  conditions  on  the 
consensus  and  coordination  algorithms  that  guarantee  that  the  cooperation  objective  is  achieved.  Toward 
that  end,  rewrite  Eq.  (10)  as 

n  n 

Ct  =  y  ]  (£j  —  £i)  +  ^  f  Qij[t)Kijl/jj.  (lb) 

J=1 
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Figure  15:  The  control  architecture  consists  of  a  consensus  algorithm  in  cascade  with  a  coordination  algo* 
rithin.  The  consensus  algorithm  receives  information  from  the  communication  network  to  produce  a  value  of 
the  coordination  variable  & .  The  coordination  algorithm  uses  the  coordination  variable  £,  to  produce  a  com* 
mand  to  the  vehicle  u,.  We  assume  that  identical  consensus  and  coordination  algorithms  are  implemented 
on  each  vehicle. 


Letting  Xij  —  £  -  and  x  —  (xn,  ^i2»  ■  ■  • , z2lj . *  * ,  a;nTl)rT  we  get  the  state-space  model 

x  =  A(t)x  -f  f?(f)t/  (17) 

where  v  is  a  column  vector  created  by  stacking  the  communication  noise  terms  and  the  elements  of  j4(£) 
and  B(t)  are  linear  combinations  of  gijKij(t)  and  can  be  easily  constructed  from  Eq.  (16),  The  vector  x 
represents  the  total  consensus  error. 

Theorem  2,4.  Under  the  hypothesis  of  Theorem  2,1,  the  Kalman  consensus  scheme  given  by  Eqs.  (8),  (9), 
and  (17)  is  input-to-state  stable. 

The  proof  of  this  theorem  requires  the  following  two  lemmas. 

Lemma  2*7.  Under  the  hypothesis  of  Theorem  2A}  if  the  communication  error  v  is  zero,  then  the  consensus 
error  x  is  uniformly  stable :* 

Proof;  Note  that  £(£}  =  $(Mo)£(to)>  where  4>(Mo)  is  a  stochastic  matrix  according  to  Lemma  2.2,  As 
a  result,  we  see  that  the  ith  coordination  variable  £j(t)  is  equal  to  a  weighted  average  of  all  agents’  initial 
coordination  'variables  communicating  with  agent  i.  Since  a  weighted  average  can  never  be  greater  (or 
smaller)  than  any  one  of  the  components  in  the  average,  we  know  that  fr(f)  €  [min,  £j(£q),  maxj  fj(M]  f°r 
all  t  and  i.  Then  it  is  straightforward  to  see  that 

INOIL  £  N<o)[looi  for  t>t0. 


Lemma  2*8*  The  norm  of  B(t)  in  Eq.  (17)  is  bounded L 

Proof;  Since  B(t)  is  composed  of  linear  combinations  of  Kij(t),  if  ||J^j(£)||  is  bounded  for  eacli  (z,  j),  then 
||i?(£)||  will  also  be  bounded,  ||K^|[  was  shown  to  be  bounded  in  the  proof  of  Theorem  2.L  ■ 

Proof  of  Theorem  2.4;  By  Lemma  2,7,  the  Kalman  consensus  error  is  uniformly  stable.  By  Theorem  2,1, 
lift  “  £j||  — *  0  as  £  — ►  oo  for  all  (i, j).  Since  each  element  of  x  — ►  0,  then  ||a:||  -h <■  0  as  t  -+  oo  and  we  conclude 
uniform  asymptotic  stability.  Any  linear  system  that  is  uniformly  asymptotically  stable  is  also  uniformly 
exponentially  stable  [51],  Additionally,  linear  uniformly  exponentially  stable  systems  with  ))*£?(£) ||  <  /3  for 
finite  (3  are  bounded- input  bounded- out  put  stable  [51).  Since  the  Kalman  consensus  error  governed  by 
Eq.  (17)  is  a  linear  uniformly  asymptotically  stable  system  with  ||I?(t)||  bounded,  it  is  ISS.  ■ 
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Corollary  2.5.  If  the  continuous- time  consensus  schemes  presented  in  [16],  [38],  [34],  and  [35]  are  aug¬ 
mented  with  communication  noise,  then  the  representation  of  these  schemes  that  is  equivalent  to  Eq>  (17)  is 
ISS. 

Proof:  The  difference  between  each  of  these  schemes  and  Eq,  10  is  that  the  consensus  gain  A\j(t)  is  time 
invariant.  Therefore,  from  the  proof  of  Theorem  2 A  it  is  clear  that  they  are  ISS.  ■ 

Referring  to  Figure  15  we  see  that  the  combination  of  the  communication  network  and  the  consensus 
scheme  is  an  ISS  system.  It  is  well  known  that  the  cascade  combination  of  two  ISS  systems  is  also  ISS. 
Therefore  if  the  feedback  loop  containing  the  coordination  algorithm  and  the  ith  vehicle  is  ISS  from  the 
consensus  error  to  the  cooperation  objective,  then  the  total  system  will  be  ISS  from  the  communication  noise 
to  the  cooperation  objection.  This  concept  is  shown  schematically  in  Figure  16  and  can  be  summarized  by 
the  following  Theorem. 


Figure  16:  The  distributed  cooperative  control  problem  can  be  thought  of  as  a  cascade  connection  between 
the  consensus  algorithm  and  the  coordination  algorithm.  If  both  are  ISS,  then  the  cascade  system  will  be 


ISS 


Theorem  2.6.  If  a  corisensus  scheme  is  ISS  from  the  communication  noise  to  the  consensus  error  and  a 
coordination  scheme  is  ISS  from  the  consensus  error  to  the  cooperation  objective ,  then  the  cascade  intercon¬ 
nection  of  the  two  (see  Fig.  16)  is  ISS  from  the  communication  noise  to  the  cooperation  objective , 


2  A  Illustrative  Example  -  Distributed  Cooperative  Timing  for  a  Team  of  UAVs 

Suppose  that  a  team  of  UAVs,  flying  at  distinct  altitudes,  is  tasked  to  simultaneously  visit  a  pre- specified 
location.  For  simplicity,  also  assume  that  paths  with  appropriate  velocities  have  been  precomputed  for  each 
UAV  as  shown  in  Figure  17.  Algorithms  that  achieve  this  functionality  are  described  in  [41], 

We  will  also  assume  that  each  UAV  has  autopilot  functionality  that  maintains  the  UAV  on  its  pre-defined 
path,  but  that  the  velocity  along  the  path  can  be  adjusted  to  meet  the  simultaneous  arrival  objective  [17], 
We  will  assume  that  the  velocity  hold  autopilot  has  been  designed  such  that 


Vi  =  Gi  {vf  -  Vi)  (18) 

where 
of  the 

Given 


av  >  0,  u*  is  the  velocity,  and  u f  is  the  commanded  velocity  for  the  ith  UAV.  Let  Lj  denote  the  length 
path  remaining  to  the  target,  then 

Li  =  -Vi. 

Li  and  the  ith  UAV  can  estimate  its  expected  time-of-arrival  as 


Therefore 


Vi  Lj  -  LiVi 
Ti  =  - - s - 

■i  p* 


(*?)■ 
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Figure  17:  Cooperative  timing  scenario  with  five  UAVs, 


The  cooperation  objective  for  this  problem  is  that  each  UAV  arrives  at  its  destination  simultaneously^  be, 
Ti  —  Tj  =  0  for  each  (i,  j).  The  coordination  variable  for  this  problem  is  chosen  as  the  arrival  time.  Therefore 
represents  the  ith  UAVs  understanding  of  the  team  arrival  time.  Letting 

«,■  =Vi  +  —  (7Tj  -  -  1)  (19) 

«iT, 


we  get  that 


Note  that 


Ti  =  “7T*  + 


{Ti  -fj)  =  -7 Ti  +  7?i  +  7Tj  -  7& 

=  —7 -  Tj)  +  'y{£i-£j), 

and  that  the  system  0  —  —7 <j>  +  7 u  is  input-to-state  stable.  In  fact  we  have  that 

|4>(0I  <  e_7<!_(o!0(^o)  +  sup  |u(<j)|  . 

t0<a<t 

Therefore,  from  Theorem  2.6,  the  combination  of  the  consensus  strategy  given  by  Eqs.  (8)-(10)  and  the 
velocity  controller  given  by  Eq,  ( 19)  is  input-to-state  stable  with  the  input  being  communication  noise  and 
the  state  consisting  of  both  the  consensus  discrepancy  £T  —  and  the  UAV  arrival  discrepancy  r*  —  Tj. 

The  cooperative  timing  scenario  was  simulated  with  an  unreliable  switching  communication  topology. 
The  team  is  connected  in  the  graph  shown  in  Fig.  18  where  each  link  is  only  available  70  percent  of  the  time. 
When  an  agent  receives  communication  it  updates  its  estimate  of  £  using  the  Kalman  consensus  scheme  of 
Section  2.L1,  In  between  consensus  updates,  agents  control  their  velocity  using  Equation  (19).  Five  agents 
were  given  a  single  target  at  which  to  arrive  simultaneously.  Fig.  17  shows  the  application  scenario,  where 
each  red  circle  represents  an  agent,  the  blue  circles  represent  threats,  the  blue  square  represents  the  target, 
and  the  green  lines  are  the  waypoint  paths. 

In  the  first  case*  communication  noise  was  set  to  zero  and  each  agent  started  with  approximately  the 
same  confidence  in  its  estimate  of  ^  A  plot  of  £  for  each  vehicle  is  shown  in  Fig.  19(a)  and  Fig.  19(b)  shows 
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5 


Figure  IS:  Union  of  possible  communication  topologies. 


r  for  each  vehicle.  As  can  be  seen,  each  agent  in  the  team  achieves  agreement  using  consensus,  adjusts  its 
velocity  to  match  and  arrives  at  the  target  in  approximately  20  seconds. 


Figure  19:  Cooperative  timing  with  no  communication  noise. 

In  the  second  case,  significant  communication  noise  is  added.  £  is  shown  for  each  vehicle  in  Fig.  20(a) 
and  r  for  each  vehicle  is  shown  in  Fig.  20(b).  As  can  be  seen,  each  agent  in  the  team  achieves  approximate 
agreement  using  consensus  where  the  error  in  agreement  is  due  to  the  communication  noise. 

2.5  Conclusions 

This  section  has  considered  the  problem  of  consensus  seeking  with  relative  uncertainty  in  distributed  multi¬ 
agent  systems.  We  have  proposed  discrete-time  and  continuous-time  Kalman  filter- like  consensus  schemes 
that  are  appropriate  when  different  agents  in  the  group  may  have  different  confidences  about  their  information 
state.  Sufficient  conditions  have  been  shown  for  consensus  seeking  using  the  proposed  consensus  schemes 
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(a)  Estimated  team  time  of  arrival,  £,  for  each  agent 


(b)  Actual  time  of  arrival,  rt  for  each  agent 


Figure  20:  Cooperative  timing  with  significant  communication  noise. 


under  switching  interaction  topologies.  Consensus  schemes  were  shown  to  be  input- to-state  stable  from  the 
communication  noise  to  the  consensus  error.  This  fact  was  exploited  in  an  application  to  a  UAV  distributed 
cooperative  timing  scenario. 
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3  Average  Consensus  and  Message  Passing 

For  teams  of  autonomous  agents,  the  ability  to  cooperate  in  a  decentralized  manner  can  enhance  the  overall 
effectiveness  of  the  team.  Central  to  decentralized  cooperation  is  the  consensus  problem  which  lias  been 
investigated  recently  by  a  number  of  researchers  [34,  19,  35,  36]. 

In  general  terms,  the  consensus  problem  for  a  group  of  agents  is  to  ensure  that  as  time  progresses  each 
agent  approaches  a  consistent  understanding  of  their  shared  information.  In  the  general  problem*  the  value  to 
which  the  team  converges  is  arbitrary,  the  only  requirement  being  that  all  agents  eventually  agree.  Average- 
consensus  problems  add  the  restriction  of  requiring  that  the  final  value  (the  group  decision)  be  the  exact 
average  of  the  agents'  initial  values.  The  average- consensus  problem  is  a  part  of  the  family  of  ^-consensus 
problems  [52]  where  the  value  that  the  team  is  to  converge  is  a  function  y  of  the  initial  values  of  the  team  (e.g. 
max  or  min).  Recently  average-consensus  has  been  used  as  a  basis  for  distributed  Kalman  filters  [53,  54]. 

In  [35]  Glfati-Saber  and  Murray  propose  a  distributed,  linear,  continuous- time  protocol  that  ensures 
that  average-consensus  is  achieved  asymptotically  if  the  interaction  networks  connecting  the  agents  switch 
between  balanced,  strongly  connected  graphs.  This  section  will  extend  those  results  to  the  discrete- time 
domain  as  well  as  relax  the  restriction  of  requiring  the  interaction  topology  to  be  strongly  connected  at  each 
instant.  Our  main  result  will  be  to  show  that  if  the  interaction  topology  at  any  instant  is  balanced  and 
the  union  of  the  network  graph  is  strongly  connected  over  every  interval  T\  then  average-consensus  is  still 
achieved  asymptotically.  Thus,  a  network  may  at  no  instant  be  strongly  connected,  yet  agents  in  a  team  can 
still  achieve  average-consensus. 

This  section  is  outlined  as  follows.  In  Section  3.1  we  introduce  a  formal  definition  of  average- consensus 
as  well  as  notation  that  relates  the  communication  topology  to  consensus  protocols.  Our  main  results 
are  presented  in  Section  3,2,  Section  3.3  investigates  the  practical  issues  in  forming  an  average- consensus 
protocol  in  the  discrete-time  framework  and  proposes  two  such  protocols.  The  notion  of  deadbeat  consensus  is 
introduced  in  Section  3.4  and  Section  3.5  investigates  trade-offs  between  asymptotic  and  finite- time  average- 
consensus  protocols.  Finally,  conclusions  are  offered  in  Section  3.6. 

3.1  Definitions  and  Terminology 

The  information  Row  topology  between  agents  on  a  team  is  most  naturally  represented  as  a  directed  graph. 
For  this  reason,  we  introduce  graph  theoretic  terminology  similar  to  [48]. 

Let  A  —  [fljj]  be  an  n  x  n  non  negative  matrix.  The  underlying  directed  graph  G  associated  with  A  lias 
vertex  set  K(G)  =  and  a  directed  edge  (ivj)  from  node  i  to  j  if  and  only  if  a**  ^  0  (note:  some 

authors  use  the  transpose  of  A ,  i.e.  there  is  a  directed  edge  (i,j)  from  i  to  j  if  and  only  if  ^  0),  As  we 
have  defined  the  relationship  between  a  matrix  and  its  underlying  graph,  the  nodes  sending  information  to 
node  i  can  be  determined  by  the  nonzero  entries  in  rowr  L  Nonzero  entries  in  column  i  indicate  which  nodes 
are  receiving  information  from  node  i.  Note  that  two  matrices  with  nonzero  entries  in  the  same  locations 
have  the  same  underlying  graph.  The  neighbors,  Nj,  of  node  i  are  all  nodes  that  communicate  to  i ,  i.e, 
Ni  =  {j  |  Oij  ^  0}.  By  convention,  we  assume  that  each  node  can  communicate  with  itself,  so  an  >  0  Vi 
and  i  €  Nt, 

The  graphs  associated  with  matrices  can  be  connected  m  a  variety  of  ways.  Connectivity  of  the  network 
can  be  roughly  classified  as  follows: 

*  Fully  Connected:  Each  node  has  as  its  neighbors  all  other  nodes  in  the  network. 

*  Strongly  Connected:  Each  node  has  a  path  that  follows  the  directed  edges  of  the  graph  to  every  other 
node  in  the  network.  A  direct  connection  to  all  other  nodes  is  not  necessary,  but  information  fiow  from 
each  node  must  reach  all  other  nodes. 

*  Spanning  Tree:  At  least  one  node  has  a  path  that  follows  the  directed  edges  of  the  graph  to  every  other 
node  in  the  network. 
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Graphs  can  also  be  connected  over  time  by  considering  the  union  of  the  communication  links  over  an  interval 
of  time  (i.e.  the  union  contains  all  edges  that  were  active  during  that  interval).  A  reversed  graph  is  simply 
a  graph  with  the  direction  of  the  links  reversed.  Note  that  a  reversed  graph  is  associated  with  the  transpose 
of  the  original  matrix. 

Each  node  has  an  associated  value  X*  6  IR  which  represents  the  information  on  which  the  team  must 
come  to  agreement.  The  set  of  nodes  {1, ■  is  said  to  be  in  consensus  if  Xj  —  Xj  for  all  When  each 
Xi  =  ^  ^  -Xj[0]  the  team  is  said  to  have  reached  average-consensus.  A  consensus  protocol  defines  how  a 
node  should  update  is  value  of  x*  based  on  the  values  of  its  neighbors.  The  simplest  scheme  is  to  require 
that  each  node  update  its  value  Xi  to  some  weighted  linear  combination  of  its  neighbors  values. 


zjfc+l]  —  (X\j  Xj  [fc] 

j  GM 

The  dynamics  of  the  information  vector  x  =  {x*,  can  then  be  defined  as 

x[A;+l]  =  A[fc]x[fc] 

where  the  sign  of  each  entry  in  A[fc]  is  given  by  the  communication  topology  at  time  k ,  but  the  value  atj  for 
the  nonzero  elements  is  determined  by  the  protocol. 

Let  ho)  =  A[k]A[k—  1]  ■  ■  ■  A[^0]f  then  at  each  k  the  information  vector  can  be  described  by 

x[k+ 1]  =  $4(fc>0)x[0]* 

Consensus  is  said  to  be  reached  asymptotically  if 

lim  *A(Jfc,0)  =  lyT  (20) 

k-+OG 

where  1  is  the  vector  of  all  ones,  y*  >  0,  and  lTy  =  1.  Notice  that  if  Eq,  (20)  is  satisfied,  then  x  — *  lyTx[0] 
implying  that  each  x*  approaches  the  same  convex  combination  of  the  agents’  initial  values.  Equivalently, 
average-consensus  is  said  to  be  reached  asymptotically  if 

lim  0)  =  -11T.  (21) 

oo  n 

3*2  Average-Consensus  under  Switching  Topologies 

The  results  for  linear  consensus  protocols  under  switching  interaction  topologies  have  been  well  studied  [19, 
36]  with  the  main  result  being  that  the  union  of  the  interaction  graphs  over  every  interval  T  must  contain 
a  spanning  tree  to  reach  consensus.  We  will  draw  similar  conclusions  with  respect  to  average-consensus. 
Theorem  1  develops  the  conditions  for  each  A[k]  that  allows  Eq.  (21)  to  be  satisfied.  This  requires  the 
following  two  Lemmata. 

Lemma  1  {Proposition  1  in  [36]),  Let  x[Ar+l]  —  *4[fc]x[Ar]  where  A[fc]  =  [a^  >  0],  Ylj  =  h  aa  >  0  for  alt 
k ,  and  each  nonzero  entry  is  both  uniformly  upper  and  lower  bounded.  If  there  exists  T  >  0  such  that  for 
evei'y  interval  [fc,  fc+F]  the  union  of  the  interaction  graph  across  the  interval  contains  a  spanning  tree ,  then 
consensus  is  asymptotically  achieved  ( Le .  Eq .  (20)  is  satisfied). 

A  similar  result  is  implicit  in  [55].  Notice  that  each  node  has  the  ability  to  choose  the  weight  associated 
with  the  information  from  each  of  its  neighbors  to  ensure  that  its  row  sums  to  one.  If  the  team  is  connected 
often  enough  (i.e.  has  a  spanning  tree  over  every  interval  of  length  F),  then  Lemma  1  ensures  that  consensus 
is  reached. 

Lemma  1  requires  that  the  row  sums  of  A[k]  be  one  and  that  a  spanning  tree  be  achieved  in  every  interval 
of  length  T  for  consensus  to  be  reached.  Now  consider  the  reversed  dynamics  x[Ar+l]  =  £f[fc]x(A:]  where  each 
column  sum  is  equal  to  one. 
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Lemma  2.  Let  x[A+l]  -  B[A]x[A]  where  £?[£]  =  >  0] ^  Yli^j  =  b  &ti  >  0j  a™d  eac h  nonzero  entry 

btj  is  both  uniformly  upper  and  lower  bounded -  Under  switching  interaction  topologies,  if  there  exists  T  >  0 
such  that  for  every  interval  [fc,  k-\-T]  the  union  of  the  reverse  interaction  graph  across  the  interval  contains 
a  spanning  tree t  then 

lint  Afa)  =  V 1T 

fc~*oo 

uAere  >  0  and  yT  1  =  1 . 

Pnpo/;  If  the  column  sums  of  Z?[A]  are  equal  to  one  and  a  spanning  tree  is  achieved  in  the  reverse  graph, 
then  Br[k j  has  row  sums  of  one  and  a  spanning  tree  is  achieved  in  the  regular  graph.  By  application  of 
Lemma  1  lim*-™  $£r(A,  Ao)  =  l£T,  so 


iimfc 

— *QQ  A) 

[\imk~*ce®B-r(kQlk)]T 

limjt-#oo  &BT(kh  A)T 
lim^oo  $s(A,Ad) 


13/T 

yiT 


T 


The  fact  that  ^aT(A,  fco)  =  lzT  =>  (Afr  A)  —  lyr  can  be  seen  by  noting  that  eacli  i?T[A]  is  row  stochastic 
with  positive  diagonal  entries  and  if  the  product  i?r[A]i?r[A4T]  *  *  ■  Dt[&+T]  contains  a  spanning  tree,  then 
the  product  Dr[A+T]iJr[A+T-“l]  Br[k]  also  contains  a  spanning  tree.  Wolfowitz  [50)  showed  that 
infinite  products  of  SIA  matrices  {a  superset  of  matrices  that  have  a  spanning  tree  and  are  row  stochastic 
with  positive  diagonal  entries)  converge  to  the  form  l^r  in  any  product  order  (however,  the  value  of  y  will 
be  dependent  on  the  actual  order).  ■ 

Theorem  1.  Let  x[/:+l]  =  j4[A]x[A]  where  A  [A]  =  [ctij  >  Gj,  =  b  aij  =  !f  aa  >  0,  and  each 

nonzero  entry  a jj  is  both  uniformly  upper  and  lower  bounded !.  Under  switching  interaction  topologies,  if  there 
exists  T  >  0  such  that  for  every  interval  [fc,A:+T]  the  union  of  the  interaction  graph  across  the  interval  is 
strongly  connected f  then  Eg.  (21)  is  satisfied  and  average- consensus  is  reached  asymptotically. 

Proof:  Since  A[h]  is  strongly  connected  over  each  interval  [A,  A+T],  then  A[k\  has  a  spanning  tree  in  both 
the  regular  graph  and  the  reverse  graph.  Therefore,  the  matrix  A[k]  satisfies  all  the  conditions  in  Lemma  1 
and  Lemma  2.  Consequently, 

lim  #a(A,  A0}  =  1  yT 

k—*OQ 

lim  4>>i(A,  Ao)  —  zlT 

k—*oc 

lim  $^(A,  A o)  —  - 

fc— oo  n 


and 

so 


3.3  Distributed  Protocol 

Careful  examination  of  Theorem  1  will  reveal  that  finding  a  distributed  protocol  to  satisfy  the  hypotheses  of 
the  theorem  will  be  difficult.  Specifically,  at  each  instant  in  time,  the  row  and  column  sums  must  be  equal 
to  one.  In  the  general  consensus  problem,  only  the  row  sums  are  required  to  be  one.  Since  the  neighbors  of 
agent  i  are  determined  completely  by  row  i ,  then  each  agent  simply  chooses  appropriate  weights  for  each  of 
its  neighbors  values  ensuring  that  the  weights  sum  to  one.  In  the  average-consensus  case,  not  only  do  the 
weights  associated  with  the  neighbors  of  i  need  to  sum  to  one,  but  all  nodes  for  which  i  is  a  neighbor  must 
weight  the  information  from  i  such  that  the  column  sum  is  equal  to  one.  This  section  will  investigate  this 
subtlety  and  propose  two  protocols  that  achieve  average-consensus  in  a  distributed  manner. 

To  illustrate  the  difficulty  of  requiring  both  row  and  column  sums  to  be  one,  consider  the  network  topology 
shown  in  Figure  2L  The  matrix 
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Figure  21:  Simple  network  over  which  the  average- consensus  problem  can  be  solved,  but  which  requires 
global  information  to  be  available. 
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has  as  its  underlying  grapli  the  topology  shown  in  Figure  21  and  has  row  and  column  sums  equal  to  one.  If  the 
underlying  network  topology  remains  fixed,  then  by  Theorem  1,  the  system  will  achieve  average- consensus 
asymptotically.  In  one  sense,  the  protocol  is  distributed  since  each  agent  only  uses  the  information  received 
from  its  neighbors;  however,  the  first  agent  weights  all  neighbors1  values  equally,  the  second  agent  weights  its 
own  value  twice  as  much  as  its  neighbors,  and  the  third  agent  weights  its  neighbors  values  twice  as  much  as 
its  own.  In  order  to  determine  the  entries  in  A ,  some  global  knowledge  of  the  network  topology  is  required 
-  Le.  there  is  no  simple  rule  that  an  agent  can  use  to  determine  the  weight  it  gives  to  information  from  its 
neighbors  without  knowledge  of  the  global  topology. 

An  ideal  protocol  would  be  able  to  achieve  average- consensus  without  using  global  information.  We 
will  investigate  two  protocols  that  impose  additional  restrictions  oil  the  types  of  graphs  involved,  but  that 
achieve  average-consensus  without  resorting  to  global  information.  The  first  is  proposed  in  [35 j  and  requires 
the  definition  of  the  graph  Lapiacian.  Let  L  be  defined  element-wise  as 


f .  ,_f  J  ~  ? 

u  \  -a 

where  =  1  if  there  is  a  communication  link  from  node  j  to  node  i  and  ~  0  otherwise  (here  A  =  [q^J 
is  simply  the  adjacency  matrix  of  a  graph  GT).  The  protocol  is  then  defined  in  terms  of  the  Lapiacian 


x[k+l\  =  (I-eL)x[k]  (22) 

where  e  E  {0t  l/max*  («).  Notice  that  the  row  sums  of  L  are  all  zero  by  construction,  so  the  row  sums 
of  A  =  /  —  eL  are  all  one.  If  e  E  (0,1/  max^},  then  A  will  also  be  nonnegative  and  consensus  will  be 
guaranteed  if  the  graph  contains  a  spanning  tree  in  every  interval  T, 

Olfati-Saber  and  Murray  show  in  [35]  that  when  a  graph  is  balanced  then  the  column  sums  of  L  are  zero, 
A  balanced  graph  is  one  in  which  at  each  node  the  out-degree  equals  the  in-degree,  i.e.  each  node  sends 
information  to  as  many  a s  send  information  to  it.  Notice  that  when  G  is  balanced  then  L  has  column  sums 
of  zero,  and  A  has  column  sums  of  one.  So,  by  Theorem  1,  the  protocol  (22)  will  achieve  average-consensus 
if  the  network  switches  between  instantaneously  balanced  networks  which  are  strongly  connected  over  every 
interval  T. 

Protocol  (22)  is  almost  completely  distributed  since  each  node  determines  the  weight  to  associate  with 
information  from  its  neighbors  without  knowledge  of  the  grapli  topology;  however,  all  nodes  must  have  the 
same  value  of  c  whose  upper  limit  is  determined  by  the  connectivity  of  the  graph.  Certainly,  for  fixed  number 
of  agents  n,  c  E  (0, 1/n]  will  ensure  that  A  remain  nomxegative  and  Theorem  1  will  apply.  This  requires 
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a  priori  knowledge  of  the  size  of  the  team,  especially  since  the  larger  the  value  of  e  the  faster  the  rate  of 
convergence  (the  second  eigenvalue  will  be  closer  to  zero,  see  [35]).  Setting  €  =  l/N  where  N  is  an  upper 
bound  on  the  number  of  agents  on  the  team  will  ensure  that  average-consensus  is  achieved  asymptotically. 

To  illustrate  the  applicability  of  this  protocol  consider  a  scenario  where  the  network  topology  switches 
randomly  from  between  the  graphs  in  Figure  22.  The  convergence  for  two  values  of  e  are  shown  in  Figures  23 
and  24,  In  this  example,  the  sum  of  the  initial  conditions  is  one.  Notice  that  with  both  values  of  e  the  value 
to  which  the  system  converges  is  | ,  but  the  larger  value  of  e  gives  faster  convergence. 
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Figure  22:  Example  scenario  where  the  network  topology  switches  randomly  between  these  three  graphs. 


Figure  23:  Protocol  (22)  with  c  =  ~  under  switching  topologies. 

When  every  communication  link  is  bi-directional  (he,  G  is  an  undirected  graph),  then  the  graph  is 
trivially  balanced.  In  this  case,  it  is  possible  to  develop  a  protocol  that  can  be  implemented  without  a  priori 
knowledge  of  team  size.  Assume  that  agents  have  the  ability  to  negotiate  with  each  of  their  neighbors  to 


38 


BYU 

3>3  Distributed  Protocol  Mag icc  Lab 


Figure  24:  Protocol  (22)  with  e  =  ^  under  switching  topologies. 


isolate  the  exchange  of  information  to  just  one  neighbor  at  a  time.  During  this  communication  event,  both 
agents  update  their  values  to  be  the  exact  average  of  the  values  present.  For  a  two-agent  communication 
event  (i,  j),  the  protocol  matrix  A  will  be  the  identity  matrix  with  the  exception  of  —  a ^  -  ajj  — 

Notice  that  each  A[k]  retains  the  characteristic  of  having  row  and  column  sums  equal  to  one.  Essentially,  each 
agent  cycles  through  available  communication  channels  to  isolate  a  single  neighbor  at  a  time  and  effectively 
change  its  in-degree  and  out-degree  to  one  at  each  instant.  If  over  every  interval  T  the  union  of  these 
simple  graphs  is  connected,  then  the  conditions  in  Theorem  1  are  satisfied  and  average-consensus  is  achieved 
asymptotically. 

An  example  of  this  protocol  is  shown  in  Figure  26,  In  this  scenario,  the  agents  are  connected  in  a  static 
graph  of  the  form  {1  «-»  2, 2  3, 3  4}.  The  agents  negotiate  with  their  neighbors  so  that  each  agent  only 

communicates  with  one  other  agent  at  a  time.  For  simulation  purposes,  this  can  be  modeled  as  the  system 
switching  randomly  between  the  graphs  in  Figure  25.  Observe  that  the  final  value  is  the  exact  average  of 
the  agents’  initial  conditions. 

To  summarize,  the  JLaplacian  protocol  of  Eq.  (22)  can  achieve  average- consensus  if  the  interaction  topology 
is  balanced  at  each  instant  and  is  strongly  connected  over  every  interval  T+  It  requires  that  some  knowledge  of 
the  maximum  connectedness  or  maximum  number  of  agents  be  available  a  priori  to  determine  the  parameter 
€.  A  second  protocol  reduces  available  communication  to  allow  only  simple  two-agent  interactions.  This 
requires  bi-directional  communication  between  agents  (more  restrictive  than  balanced)  but  assumes  no  a 
priori  knowledge  of  the  network  topologies  or  team  size. 


39 


BYU 

3.4  Deadbeat  Consensus  MagjCG  Lab 


1 

4 

1 

4 

1 

4 

l 

i 

4 

o 

O 

o 

O 

O 

o 

0 

O 

i  i 

i 

i 

i 

i  < 

1 1 

1 

r 

1 

i  f 

0 

o- 

—*o 

o 

0 

0 

0 

2 

3 

2 

3 

2 

3 

2 

3 

Figure  25:  Example  scenario  where  the  topology  remains  fixed  (a  path),  but  the  agents  negotiate  through 
one  of  the  above  graphs  at.  each  instant. 


Figure  26:  Results  of  simple  two-agent  events  to  achieve  average  consensus. 

3.4  Deadbeat  Consensus 

Discrete-time  systems  can  exhibit  finite-time  convergence  when  the  poles  of  the  system  are  all  at  zero  -  i.c. 
when  a  system 

a:[A;+l]  —  Ase[fc] 

has  nilpotent  matrix  A ,  then  x[k]  —  0  Vfc  >  n,  regardless  of  initial  conditions.  This  notion  of  “deadbeat" 
response  motivates  a  similar  investigation  of  consensus  systems.  This  section  will  consider  the  conditions 
under  which  consensus  can  be  reached  in  finite  time. 

Let  ?  be  a  consensus  protocol;  specifically,  let  the  interaction  matrices  generated  by  V  have  row  sums 
equal  to  one.  Note  that  if  V  solves  either  the  general  consensus  problem  or  the  average-consensus  problem, 
the  row  sums  of  the  interaction  matrices  will  be  one.  At  any  time  k .  the  value  of  the  system  given  the  initial 
conditions  at  k  —  0  is 

x[k+ 1]  =  (A[k]A{k-l}>**A[l\A[0])x[0]. 
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Theorem  2,  Let  V  be  a  consensus  protocol,  if  at  some  instant,  it  the  interaction  topology  is  a  fully  connected 
graph  and  V  yields  the  interaction  matrix  at  that  instant 

A[£)  =  ±11T 
n 

then  the  team  will  be  exactly  in  consensus  for  all  k  >  £, 

Proof:  A  group  of  agents  is  in  consensus  if  x*  =  x3  for  every  pair  (ir  j).  Since 

x[£+l]  =  A[f\x[(\  =  — 

71 

then  each  element  i  of  x[£+l]  is 

x([£+l]= 

j- 1 

Therefore,  the  group  has  readied  consensus  at  time  (£+1).  Because  V  ensures  that  each  interaction  matrix 
after  time  £  has  row  sums  equal  to  one,  then  for  all  k>  t  the  group  remains  in  consensus  (each  node  updates 
to  a  weighted  sum  of  the  same  value). 

To  show  that  the  same  conditions  on  A  lead  to  deadbeat  average- consensus,  notice  that  V  has  row  and 
column  sums  equal  to  one  at  each  instant  so 


n 


n 


i=i  t=i 


for  all  k  >  0.  Recall  that  if  a  matrix  A  iias  row  and  column  sums  equal  to  one,  then  1  is  both  a  left  and 
right  eigenvector  associated  with  eigenvalue  l ,  so 


^x*[fc)  —  lTx[A;] 

i=i 

=*  lTx[fc— 1] 


-  lTx[0] 

=  X>!'  °i- 


Therefore,  for  each  agent  i  at  time  £ 

^ 

i=i  j=i 

which  implies  that  the  group  has  reached  average-consensus*  ■ 

Theorem  2  requires  that  at  some  instant  the  communication  graph  is  fully  connected,  he  every  agent  can 
communicate  with  every  other  agent  and  that  the  interaction  matrix  generated  by  the  consensus  protocol 
yields  A  —  “11T  One  consequence  of  this  is  that  deadbeat  average- consensus  is  much  more  difficult  to 
achieve  than  regular  deadbeat  consensus.  This  is  due  to  the  fact  that  average-consensus  protocols  do  not 
generally  yield  the  proper  interaction  matrix  when  the  communication  graph  is  fully  connected. 
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The  reader  will  note  that  even  when  a  graph  is  fully  connected  neither  protocol  from  Section  3.3  will 
yield  the  proper  interaction  matrix  to  achieve  deadbeat  consensus.  A  consensus  protocol  of  the  form 


E  *#1 


(23) 


jSNt 


will  allow  regular  deadbeat  consensus  since  whenever  the  network  is  fully  connected,  each  agent  updates 
its  value  to  the  average  of  all  the  other  agents.  Unfortunately,  such  a  simple  protocol  does  not  lead  to 
average-consensus  in  the  general  case.  Consider  the  balanced  network  shown  in  Figure  27  with  interaction 
matrix 


3  0  3  0 
2  2  0  2 
0  3  3  0 
0  3  0  3 


which  does  not  have  column  sums  equal  to  one.  In  fact 


lim  Ak 


1 

9 


2  3  2  2 
2  3  2  2 
2  3  2  2 
2  3  2  2 


which  shows  that  the  protocol  defined  by  Eq.  (23)  is  not  an  average-consensus  protocol.  It  is  interesting 


3 


Figure  27:  Balanced  graph  for  which  a  simple  averaging  protocol  does  not  achieve  average-consensus. 

to  note  that  if  the  network  topologies  switch  between  balanced  regular  graphs  (graphs  where  all  nodes  that 
have  any  adjacent  edge  have  the  same  degree),  then  this  protocol  does  achieve  average-consensus  (in  the 
general  and  deadbeat  case). 

3.4-1  Example  Application 

Consider  a  fixed  perimeter  which  is  to  be  monitored  by  a  team  of  N  agents  in  a  distributed  manner  (as 
in  [56]).  Let  the  consensus  variable  in  the  system  be  the  size  of  the  segment  an  agent  is  to  monitor.  The 
initial  state  of  the  system  is  when  the  first  agent  reaches  the  endpoint  of  the  perimeter  and  initializes  its 
consensus  variable  to  the  length  of  the  perimeter.  We  desire  average-consensus  so  that  asymptotically,  each 
agent  monitors  an  equal  part  of  the  perimeter. 

Using  the  protocol  of  Eq.  (23)  and  noticing  that  the  system  will  only  switch  between  balanced  regular 
graphs  (since  agents  meet  along  a  line)  deadbeat  average- consensus  may  be  reached.  Figure  28  shows  a 
scenario  where  at  no  time  are  all  agents  in  communication,  so  average- consensus  is  achieved  asymptotically. 
In  contrast,  Figure  29  shows  a  scenario  where  agents  are  launched  in  close  proximity  and  meet  in  a  fully 
connected  group  near  the  beginning  of  the  mission  achieving  deadbeat  average- consensus. 
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Figure  28:  Perimeter  surveillance  using  average- consensus  to  distribute  the  team  evenly  along  the  perimeter. 
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Figure  29:  Perimeter  surveillance  where  deadbeat  average-consensus  occurs. 
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3,5  Finite- Time  Average- Consensus 

An  average- con  sens  us  protocol  will  invariably  require  a  strongly  connected  network  since  each  agent  must 
be  able  to  influence  the  group  decision  to  reflect  its  initial  condition.  Obviously,  if  each  agent  transmits 
its  initial  condition  to  its  neighbors  and  passes  any  communication  received  from  others  along,  then  if  a 
strongly  connected  network  is  available,  eventually  all  agents  will  have  the  complete  set  of  initial  conditions 
from  which  the  average  can  be  computed  (dearly,  this  is  not  novel;  Lynch  [57]  classifies  such  an  algorithm  as 
trivial).  Using  this  method,  all  agents  will  have  the  information  necessary  to  be  in  average-consensus  after  d 
steps  where  d  is  the  diameter  of  the  graph.  Indeed,  it  may  seem  that  the  restriction  to  a  strongly  connected 
network  eliminates  any  need  for  an  asymptotic  protocol.  This  section  will  investigate  the  trade-offs  between 
an  asymptotic  protocol  (such  as  (22))  and  a  simple  message  passing  protocol. 

The  main  advantage  of  an  asymptotic  consensus  protocol  is  the  small  amount  of  bandwidth  required  - 
each  agent  needs  only  to  send  its  current  value.  Additionally,  there  is  no  need  to  identify  individual  agents 
or  know  the  number  of  agents  in  the  team.  On  the  other  hand,  a  message  passing  protocol  could  keep  track 
of  which  initial  conditions  it  has  sent  to  each  of  its  neighbors  and  effectively  limit  its  bandwidth  to  be  the 
same  as  the  asymptotic  protocol,  relying  instead  on  repeated  interaction  to  transmit  all  initial  conditions.  At 
each  instant,  a  node's  value  would  be  the  sample  average,  i.e,  the  average  of  all  initial  conditions  received  so 
far.  For  large  networks,  however,  the  amount  of  overhead  and  the  complexity  may  be  prohibitive.  Perhaps 
the  main  advantage  of  the  message  passing  protocol  is  the  ability  to  utilize  any  type  of  data  (not  simply 
continuous  real  numbers)  in  any  functional  way,  i.e.  agents  are  not  limited  to  average- consensus  but  they 
can  come  to  agreement  on  any  function  of  the  initial  conditions. 

The  message  passing  protocol  effectively  emulates  a  fully  connected  graph  at  k  =  0  by  transmitting  the 
required  information  incrementally.  The  deadbeat  nature  of  the  protocol  makes  it  attractive,  especially  when 
speed  of  convergence  is  an  issue. 

Ari  asymptotic  protocol  will  be  useful  in  very  large  networks  and  in  situations  when  the  value  at  each  node 
is  driven  by  an  external  source  (agent  values  are  dynamic  rather  than  static)  such  as  distributed  Kalman 
filtering  [53],  In  many  cases,  however,  a  simple  message  passing  scheme  may  be  more  attractive  due  to  its 
deadbeat  nature  and  its  ability  to  handle  any  data  type. 

In  an  effort  to  quantify  the  performance  of  aver  age- consensus  as  compared  to  a  message  passing  protocol, 
we  performed  a  number  of  Monte-Carlo  simulations.  We  varied  team  size  and  available  bandwidth  to 
determine  under  which  circumstances  and  how  quickly  the  average  consensus  and  message  passing  algorithms 
would  converge.  For  different  bandwidth  sizes,  the  message  passing  protocol  picked  a  random  set  of  the 
bandwidth  size  and  passed  those  values  to  its  neighbors  at  every  time  step.  For  example,  in  the  bandwidth 
size  2  case,  the  message  passing  protocol  would  pick  2  random  values  from  its  vector  of  known  team  values 
and  pass  those  to  its  neighbors.  In  the  N  bandwidth  case,  an  agent  would  send  all  values  that  it  has  seen 
to  its  neighbors.  In  every  bandwidth  case,  the  average- consensus  protocol  simply  took  the  average  of  its 
immediate  neighbors  and  passed  only  one  message  to  each  of  its  neighbors. 

Each  simulation  consisted  of  2000  iterations  with  a  fixed  team  size  and  bandwidth.  Table  3  shows  the 
results  of  the  tests  where  each  element  of  the  table  corresponds  to  a  specific  team  size/bandwidth  test.  Each 
cell  contains  the  mean  and  standard  deviation  for  the  number  of  iterations  required  to  converge  to  within 
a  small  amount  of  the  final  consensus  value.  The  top  numbers  in  a  cell  correspond  to  the  message  passing 
protocol  and  the  colored  numbers  correspond  to  the  average  consensus  algorithm.  As  can  be  seen  the  message 
passing  protocol  outperforms  the  average  consensus  when  there  is  unlimited  bandwidth  (i.e,  bandwidth  size 
N).  In  that  case,  every  agent  communicates  to  its  neighbors  all  the  initial  conditions  of  the  team  that  it  has 
seen  previously.  Even  for  a  large  team  size  of  64  agents,  the  message  passing  protocol  converges  an  order  of 
magnitude  faster  than  the  average-consensus  algorithm.  As  the  available  bandwidth  goes  down,  however,  the 
average- consensus  algorithm  shines.  Even  with  a  small  team  size  of  16  agents,  at  bandwidth  of  1  message  the 
average- consensus  algorithm  converges  twice  as  hist  as  the  message  passing  protocol.  We  conclude  that,  for 
equivalent  bandwidths  much  less  than  the  size  of  the  team,  the  average-consensus  algorithm  will  on  average 
converge  faster  than  a  message  passing  protocol.  When  there  is  an  abundance  of  available  bandwidth  on  the 
order  of  the  team  size,  then  the  message  passing  protocol  will  be  the  best  choice. 
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Table  3:  Average  Iterations  to  Consensus 
Bandwidth 


1 

2 

4 

JV 

2 . 

N 

31.69  (17.95) 

15T83“(8.'348)— 

9.078  (4.011) 

15.98  (8.371) 

8.989  (3.92)  | 

109.9  (17.62) 

109.7  (17.83) 

109.7  (18,38) 

111.2  (18.2) 

110.3  (18.24) 

190.2  (91.81) 

”  ¥471  (45786)” 

47.58  (23.05) 

46.58  (22.1) 

27.95  (12.23)  | 

333.4  (44.37) 

333.5  (44.77) 

333.6  (44.51) 

331.4  (43.28) 

334.3  (46.01) 

1052  (350.7) 

542.9  (234.9) 

272.3  (120.8) 

134.4  (59,23) 

75,5  (30.2)  | 

837.4  (115.3) 

837.9  (114) 

841.8  (115.1) 

837.9  (116.5) 

839.4  (114.3) 

3166  (148.6) 

2545  (625.9) 

1393  (556.8) 

340.9  (145.4) 

186.6  (67.65)  j 

1862  (269.9) 

1857  (263.4) 

1853  (260.4) 

1857  (268.7) 

1856  (264.2) 

6400  (0) 

6388  (95.83) 

5447  (1042) 

721.2  (283.3) 

410.6  (131.4)  | 

3796  (517,8) 

3824  (526.9) 

3807  (526.1) 

3811  (506.2) 

3774  (515.3) 

In  practical  scenarios  with  wireless  radio  modems,  it  will  often  be  the  case  that  the  overhead  required  to 
begin  communication  with  an  agent  encourages  larger  communication  packets  (due  to  the  cost  of  switching 
between  neighbors).  If  the  team  is  known  to  be  moderate  sized,  the  savings  in  convergence  time  may 
be  substantia!  by  using  a  message  passing  protocol.  Finally,  we  reiterate  that  consensus  algorithms  can 
only  come  into  agreement  on  continuous  valued  variables;  a  message  passing  scheme  allows  any  function 
of  the  initial  conditions  to  be  used,  including  discrete  functions  such  as  voting  protocols.  For  very  limited 
communication  bandwidth  or  very  large  teams,  consensus  algorithms  are  a  good  choice, 

3.6  Conclusions 

This  section  has  extended  the  average- con  senses  results  of  [35]  to  allow  for  networks  to  switch  between 
instantaneously  balanced  networks  that  are  strongly  connected  over  every  interval  T.  The  discrete-time 
case  has  been  dealt  with  explicitly  and  two  asymptotic  protocols  presented  that  achieve  average- consensus 
under  switching  topologies.  The  notion  of  deadbeat  consensus  was  investigated  with  conclusion  that  general 
consensus  problems  may  best  be  solved  using  a  message  passing  mechanism  rather  than  defining  dynamics 
of  the  information  variable  if  a  strongly  connected  network  can  be  assumed. 
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4  Experiments  in  Cooperative  Timing  for  Miniature  Air  Vehicles 

The  effectiveness  of  unmanned  aerial  vehicles  (UAVs}  in  recent  years  has  increased  dramatically.  This 
increase  is  due  to  advances  in  microcontroller,  battery,  and  sensor  technologies.  As  a  result,  the  use  of 
UAVs,  primarily  in  military  applications,  has  also  increased.  Larger  UAVs,  such  as  Global  Hawk,  Predator, 
and  Hunter,  are  currently  being  used  for  border  patrol,  surveillance,  and  even  target  prosecution.  Smaller 
microcoutollers  and  sensor  packages  have  also  improved  the  capabilities  of  miniature  aerial  vehicles1 ,  and 
significant  resources  have  been  allocated  toward  the  further  development  of  these  technologies  [58].  MAVs 
have  a  number  of  applications  for  which  they  are  especially  well  suited  because  of  their  size  and  cost.  Some 
of  these  applications  include  reconnaissance,  battle  damage  assessment,  and  surveillance.  Although  there 
are  many  uses  for  MAVs  in  commercial  or  civilian  applications,  their  reliability  and  capabilities  will  have  to 
be  improved  in  order  to  fully  transition  into  the  commercial  sector. 

In  recent  years,  cooperative  control  for  teams  of  UAVs  has  been  a  heavily  researched  area.  Chandler,  et  al. 
have  explored  strategies  for  dealing  with  the  complexity  and  coupling  inherent  in  cooperative  autonomous 
UAV  teams  [59].  They  show,  by  computer  simulation,  that  using  network  flow  and  auction  algorithms 
for  task  assignment  yields  improved  team  performance  for  search  missions  with  varying  numbers  of  team 
members  and  tasks  to  be  accomplished. 

Much  work  has  also  been  devoted  to  path  planning  for  multi-agent  teams  in  high-risk  environments. 
Flint,  et  ah  have  developed  a  model  for  task  assignment  and  path  planning  involving  both  a  priori  and 
dynamic  information  about  threats  and  targets  [60].  Simulation  results  in  a  three-dimensional  environment 
illustrate  the  effectiveness  of  this  method.  Maza  and  Ollero  present  a  method  for  cooperatively  searching  an 
area  with  a  heterogeneous  team  of  UAVs  taking  advantage  of  the  capabilities  of  the  individual  agents  in  the 
team  [61].  The  area  to  be  searched  is  divided  into  a  set  of  polygons  and  the  assignments  for  the  searching  of 
these  polygons  is  based  on  which  of  the  UAVs  is  best  suited  for  the  task.  Vincent  and  Rubin  have  proposed 
to  use  swarming  methods  for  cooperatively  searching  an  area  [62].  The  UAVs  fly  in  a  formation  that  utilizes 
the  sensing  capabilities  of  each  team  member  and  easily  adapts  if  any  of  the  members  is  lost  during  the 
mission. 

This  section  extends  work  presented  previously  in  the  area  of  cooperative  timing  missions  utilizing  coordi¬ 
nation  variables  and  functions  to  make  path  planning  and  timing  decisions  [63,  32].  Coordinated  timing  over 
a  known  target  can  be  advantageous  in  a  number  of  scenarios.  One  example  is  the  case  where  a  persistent 
image  of  a  target  from  a  consistent  perspective  is  desired.  If  the  camera  footprint  for  each  MAV  in  a  team 
is  known,  arrival  intervals  for  the  team  can  be  determined  such  that  a  persistent  image  of  the  target  can  be 
obtained.  Another  scenario  is  surveillance  of  a  target  in  a  risky  environment,  A  simultaneous  arrival  of  a 
team  of  MAVs  over  the  target,  could  be  coordinated,  thus  maximizing  the  possibility  for  the  team,  or  at  least 
part  of  the  team,  to  reach  the  target.  An  optimization  algorithm  based  on  minimizing  fuel  consumption  was 
executed  to  determine  paths  and  velocities  for  each  of  the  MAVs. 

Section  4.1  outlines  these  two  scenarios  and  describes  the  issues  involved  in  achieving  cooperation  For  a 
team  of  MAVs.  Section  4.2  discusses  the  technical  approach  taken  to  address  these  issues,  and  Section  4,3 
describes  of  the  experimental  setup  used.  The  experimental  results  in  Section  4.4  are  the  primary  contribution 
of  this  section  and  illustrate  the  feasibility  of  real-time  cooperative  timing  missions  for  MAVs. 

4*1  Problem  Overview 

The  focus  of  this  section  is  experimental  demonstration  of  cooperative  timing  scenarios  for  MAVs,  thereby 
providing  practical  validation  of  our  cooperative  control  approach.  Two  specific  missions  are  considered: 
simultaneous  arrival  and  cooperative  fly- by. 

For  the  simultaneous  arrival  mission,  the  objective  is  for  the  MAVs  composing  the  team  to  arrive  at  the 
target  at  the  same  time  from  different  approach  directions.  The  ability  to  have  multiple  vehicles  arriving  at 
destination  simultaneously  is  useful  for  a  variety  of  tactical  purposes  including  formation  join-up  and  target 

1  We  consider  miniature  aerial  vehicles  to  be  those  with  wingspans  in  the  0.3  m  to  2  in  range  and  micro  aerial  vehicles  to  be 
those  with  wingspans  under  0,3  rn.  Here  the  abbreviation  MAV  denotes  mimoture  aerial  vehicle. 
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attack.  In  an  attacking  situation,  simultaneous  arrival  maximizes  the  element  of  surprise  and  increases  the 
intensity  of  the  attack. 

For  the  cooperative  fly-by  mission,  the  objective  is  for  the  MAVs  to  approach  the  target  along  the  same 
approacli  path  at  specified  timing  intervals.  The  ability  to  have  multiple  vehicles  fly  over  a  target  at  specified 
intervals  is  useful  for  persistent  surveillance  or  for  prosecuting  targets  where  a  sequence  of  tasks  is  required 
(e.g.,  identification,  attack,  battle  damage  assessment).  We  consider  the  scenario  where  the  heading  through 
the  target  is  predetermined  to  be  along  the  imaging  path  so  that  a  persistent  image  of  the  target  from  the 
same  viewing  angle  can  be  obtained. 

Both  missions  have  several  common  elements  in  their  planning  and  execution.  During  the  initial  phase 
of  the  mission,  the  MAVs  are  occupied  with  a  secondary  task  (i.e.  monitoring  secondary  targets,  positioning 
themselves  for  the  cooperative  mission,  etc.).  Upon  command  from  the  operator  at  the  ground  station,  paths 
are  planned  to  satisfy  the  timing  constraints  for  the  mission  and  to  minimize  the  cooperation  objective  (e.g., 
minimize  fuel  cost).  Paths  are  planned  based  on  current  MAV  and  target  locations  and  the  desired  headings 
through  the  target.  Upon  completion  of  the  pass  over  the  target.,  the  MAVs  return  to  the  secondary  tasks 
they  were  occupied  with  during  the  initial  phase  of  the  mission*  Challenges  associated  with  these  missions 
include  real-time  cooperative  path  planning,  reliable  and  timely  communication  of  cooperation  information, 
and  accurate  path  following  with  significant  wind  disturbances. 

The  cooperative  timing  solutions  for  the  two  missions  that  we  will  present  were  obtained  from  a  cooper¬ 
ative  timing  algorithm  that  employed  coordination  variables  and  coordination  functions  to  represent  critical 
timing  information  for  the  MAVs*  The  experimental  results  that  will  be  presented  involve  a  team  of  three 
MAVs  although  the  approach  applies  generally  to  two  or  more  vehicles.  This  cooperative  timing  strategy  is 
outlined  in  the  following  section  together  with  the  patli  planning  and  path  following  approaches  that  were 
used. 


4.2  Technical  Approach 

Cooperative  Timing  Strategy 

We  employ  a  cooperative  timing  strategy  that  is  based  on  the  utilization  of  coordination  variables  and 
coordination  functions.  The  underlying  concept  of  coordination  variables  and  coordination  functions  is  that 
the  minimal  information  essential  to  achieving  a  cooperation  objective  should  be  identified  and  communicated 
among  agents  on  the  team.  Minimizing  the  amount  of  information  being  sent  improves  communication  speed 
and  reliability*  It  also  tends  to  simplify  cooperative  control  calculations  by  reducing  the  problem  to  its 
essential  elements. 

Let  x,  define  the  situation  state  for  the  ith  vehicle  on  a  team.  For  the  cooperation  problems  considered  in 
this  section,  the  situation  state  includes  information  about  the  current  MAV  position,  the  target  position,  the 
desired  heading  through  the  target,  and  the  time  interval  between  MAV  arrivals  (zero  for  the  simultaneous 
arrival  scenario).  Additional  information  about  the  environment  (e.g.,  wind  speed  and  direction)  can  also 
be  included  if  it  is  available.  For  a  given  situation  x^,  the  set  of  feasible  decisions  for  the  ith  vehicle  is 
represented  by  £/*(Xt),  and  u*  €  Ui  is  the  decision  variable  for  the  ilh  vehicle.  The  choice  of  the  decision 
variable  by  each  vehicle  on  the  team  affects  both  the  existence  and  the  value  of  the  cooperation  objective.  In 
the  cooperative  timing  scenarios  considered,  the  decision  variable  consists  of  a  waypoint  path  and  an  average 
velocity* 

The  process  of  cooperation  among  agents  can  be  viewed  as  having  objectives  and  constraints.  For  the 
simultaneous  arrival  scenario,  the  cooperation  constraint  requires  the  MAVs  to  arrive  over  the  target  at  the 
same  time*  For  the  fly-by  scenario,  the  cooperation  constraint  requires  the  MAVs  to  fly  over  the  target 
with  the  second  vehicle  at  a  specified  time  interval  behind  the  first,  and  the  third  vehicle  arriving  at  a 
specified  time  interval  after  the  second.  Cooperation  is  said  to  occur  if  the  cooperation  constraints  are  met. 
In  the  these  scenarios,  the  cooperation  objective  is  to  minimize  the  battery  energy  required  to  complete 
the  mission.  The  contribution  of  a  vehicle  to  the  team  cooperation  objective  is  represented  by  an  influence 
function,  fa  —  In  this  case  the  energy  consumption  for  the  ilh  vehicle  (represented  by  fa)  is  a  function 

of  the  MAV  velocity  and  the  waypoint  path  (represented  by  u,). 
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The  coordination  variable#  represents  the  minimal  amount  information  necessary  to  achieve  cooperation. 
For  the  simultaneous  arrival  problem,  the  coordination  variable  is  the  arrival  time  over  the  target.  For  the 
fly-by  problem,  the  coordination  variable  is  the  arrival  time  of  the  first  MAV  and  the  arrival  order  for  the 
vehicles.  If  every  vehicle  has  knowledge  of  the  value  of  the  coordination  variable  and  responds  accordingly, 
cooperative  behavior  will  be  achieved  by  the  team.  For  a  vehicle,  the  value  of  the  coordination  variable  is 
related  to  the  decision  variable,  9X  —  /*( ut),  and  defines  what  the  vehicle  can  do  to  ensure  that  cooperation 
constraints  are  met.  In  the  scenarios  considered  in  this  section,  the  choice  of  path  and  velocity  determines 
MAV  energy  consumption  and  arrival  time  and  therefore  influences  both  the  cooperation  objective  and  the 
cooperation  constraint.  Under  the  assumptions  that  battery  power  is  proportional  to  aerodynamic  drag  and 
that  the  MAV  flies  at  constant  speed,  the  battery  energy  consumption  is  given  by 

Ji  =  CbViLi 

where  c&  >  0  is  a  constant,  is  the  MAV  velocity,  arid  Li  is  the  length  of  the  waypoint  path* 

For  the  cooperative  fly-by  scenario  with  three  MAVs,  the  cooperation  constraints  can  be  written  as 

F,  =  Tst 

T2  —  Tj  +  As*  (24) 

*3  =  Ta  +  A3j 

where  T*  is  the  arrival  time  of  the  ith  vehicle,  A,  represents  the  interval  between  the  arrival  of  the  first 
and  ith  vehicles,  and  Q  —  Ta  is  the  coordination  variable.  Note  that  for  the  simultaneous  arrival  scenario 
A2  —  A3  —  0. 

Critical  to  the  implementation  of  this  approach  is  the  definition  of  the  coordination  function.  The 
coordination  function  models  a  vehicle's  influence  on  the  cooperation  objective  in  terms  of  what  the  agent 
can  do  to  meet  the  cooperation  constraints.  The  coordination  function,  ^i(^)  is  derived  from  the  influence 
function  and  coordination  variable  as 

4>i  =  Ji(ui) 

-  Mf!  m 
=  &(0«), 

where  //  is  the  pseudoinverse  of  fc.  Typically,  /;  is  not  a  one-to-one  mapping  in  that  numerous  values 
of  the  decision  variable  (e,g,,  waypoint  paths,  velocities)  can  result  in  a  single  value  of  the  coordination 
variable  (e.g,,  time  over  target).  The  pseudoinverse  of  is  found  by  taking  the  value  of  decision  variable 
that  minimizes  the  influence  function  Ji ,  thus  creating  a  one-to-one  map  between  0i  and  u.  In  other  words, 
when  multiple  options  exist  to  get  the  same  result,  the  lowest-cost  option  is  chosen. 

Using  coordination  variables  and  coordination  functions,  the  battery  energy  required  to  complete  the 
mission  can  be  minimized  by  solving  the  optimization  problem 

9*  =  argmin  [fa  (#)  +  fa(9  +  A2)  +  +  A3)] , 

which  implies  that  the  constraint  in  Equation  (24)  is  satisfied.  Once  a  team  optimal  value  for  the  coordination 
variable  is  found,  vehicle  decisions  can  be  found  from  the  relationship 

u 

Figure  30  illustrates  the  coordination  functions  calculated  for  a  cooperative  timing  mission  based  on  input 
information  provided  to  the  algorithm.  Because  the  coordination  function  for  each  UAV  is  monotonically 
decreasing,  the  team-optimal  arrival  time  for  one  of  the  vehicles  will  always  lie  at  the  right  extreme  of  its 
coordination  function.  For  the  simultaneous  arrival  scenario,  the  solution  is  simply  determined  by  finding 
the  largest  arrival  time  that  all  three  MAVs  have  in  common.  For  the  fly-by  scenario,  determining  the 
minimum-energy  arrival  time  and  order  for  the  team  involves  a  simple  search  through  the  right  extreme 
(minimum)  values  of  each  coordination  function  with  an  evaluation  of  the  team  objective  to  determine  the 
best  values. [32] 
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arrival  time  it} 

Figure  30;  Cooperative  timing  coordination  functions. 


Path  Planning 

In  order  to  simplify  the  path  planning  for  this  initial  implementation,  some  assumptions  about  the  environ¬ 
ment  were  made.  The  first  assumption  was  that  there  were  no  threats  to  be  avoided*  The  next  assumption 
was  that  both  the  target  location  and  the  desired  fly-by  heading  were  known,  which  eliminated  any  ini¬ 
tial  target  observation  and  detection  steps  prior  to  the  cooperative  timing  mission.  Because  there  were  no 
threats  to  avoid,  a  straightforward  heuristic  approach  for  planning  straight-line  waypoint  paths  was  utilized. 
Straight-line  paths  were  easy  to  plan  and  simplified  both  the  distance  calculations  and  timing  optimization. 

Figure  31  shows  an  example  of  the  waypoint  paths  used  in  the  experiments.  Waypoint  1  for  each  agent 
was  calculated  to  be  100  m  along  the  same  heading  that  the  agent  had  when  the  coordination  algorithm 
was  executed.  Waypoint  2  was  set  to  be  a  fixed  distance  from  the  known  target  location  along  the  desired 
approach  path*  For  the  experiments,  this  distance  was  between  200  ni  and  300  m.  Allowing  this  much 
distance  ensured  that  the  transient  path  following  errors  associated  with  transitioning  from  one  straight 
line  path  to  another  would  be  damped  out  and  that  the  MAVs  would  be  flying  along  the  desired  approach 
heading  when  they  passed  over  the  target.  Waypoint  3  was  the  target  location.  Waypoint  4  was  coll  inear 
with  waypoints  2  and  3  on  the  opposite  side  of  the  target  from  waypoint  2.  Waypoints  2,  3,  and  4  were  the 
same  for  each  agent  in  the  fly-by  scenario  and  only  waypoint  3  was  the  same  in  the  simultaneous  arrival 
scenario.  These  three  points  are  referred  to  as  fly-through  points.  Waypoint  5  was  a  waypoint  to  help  the 
agents  distance  themselves  from  the  target  before  returning  to  their  final  loiter  positions,  waypoint  6. 

In  some  cases,  depending  on  the  initial  MAV  positions  and  headings,  the  coordination  functions  do  not 
yield  a  solution  that  satisfies  the  cooperative  timing  constraint.  For  example,  a  cooperative  fly-by  with 
a  spacing  interval  of  10  s  between  vehicles  could  not  be  accomplished  with  the  coordination  functions  of 
Figure  30  due  to  the  time  gap  between  the  coordination  function  of  UAV  1  and  those  of  UAVs  2  and  3,  In 
these  instances,  it  became  necessary  to  develop  some  simple  approaches  for  adding  length  to  one  or  more 
paths  so  that  the  timing  constraints  could  be  met. 

Based  on  the  coordination  functions,  the  length  to  be  added  to  a  path  was  calculated  to  ensure  constraint 
feasibility.  If  the  length  that  needed  to  be  added  was  less  than  150  m,  the  location  of  the  first  waypoint  was 
adjusted  using  the  law  of  cosines  so  that  the  total  distance  would  be  the  original  length  plus  the  necessary 
additional  length.  If  the  additional  length  was  longer  than  150  m,  two  additional  waypoints  were  added  to  the 
path  immediately  following  the  first  waypoint.  The  second  waypoint  was  calculated  so  that  its  distance  from 
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east  (m) 

Figure  31:  Waypoint  path  example. 


the  first  waypoint  was  half  of  the  required  additional  distance,  and  the  waypoint  was  placed  in  a  direction 
directly  opposite  the  target.  The  third  waypoint  was  placed  at  the  same  location  as  the  first,  thus  effectively 
adding  the  required  length  to  the  path.  The  same  path  shown  in  Figure  31  with  these  added  points  is  shown 
in  Figure  32. 

Figure  33  shows  how  the  coordination  function  for  UAV  3  was  modified  by  adding  length  to  the  original 
path.  With  the  added  length,  the  coordination  function  for  UAV  3  shifted  up  and  to  the  right  allowing  the 
10  s  interval  timing  constraint  to  be  satisfied. 

Path  Following  and  Velocity  Control 

For  timing  missions,  preserving  path  length  is  critical  in  order  to  be  able  to  satisfy  the  timing  constraints. 
Flying  extra  distance  due  to  poor  path  following  adds  unwanted  time  to  the  mission.  To  address  this  issue, 
a  vector  field  path  following  method  has  been  implemented  to  enable  the  MAVs  to  accurately  follow  the 
planned  waypoint  paths.  It  has  been  shown  in  References  [64]  and  [65]  that  creating  vector  fields  of  desired 
heading  in  order  to  direct  the  MAV  onto  the  path  will  result  in  asymptotically  decaying  tracking  error 
provided  ground  track  heading  and  ground  speed  are  used  in  the  control  law.  Use  of  this  method  decreased 
path  following  error,  and  consequently,  overall  path  length  error. 

Since  wind  speed  is  typically  10  to  50  percent  of  the  airspeed  for  a  MAV,  wind  can  have  adverse  effects 
on  path  following  and  timing  constraints.  Typically,  throttle  is  controlled  based  on  a  desired  airspeed.  This 
creates  problems  when  trying  to  fly  timing  missions  in  the  presence  of  wind.  To  address  this  issue,  the 
throttle  was  controlled  based  on  ground  speed  with  a  check  to  make  sure  that  the  airspeed  did  not  drop 
below  the  MAV  stall  speed.  Throttle  control  based  on  ground  speed  also  helps  reduce  the  influence  of  path 
length  error  on  timing. 
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Figure  32:  Waypoint  path  example  with  added  length. 


Figure  33:  Cooperative  timing  coordination  functions  with  length  added  for  UAV 
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4.3  Experimental  Setup 
Hardware  Description 

BYU  has  developed  a  reliable  and  robust  experimental  platform  to  flight  test  algorithms  for  MAVs  [17,  GO] . 
Figure  34  shows  the  key  elements  of  the  testbed.  The  first  frame  shows  BYU’s  Kestrel  autopilot  which 
is  equipped  with  a  Rabbit  3400  29  MHz  processor,  rate  gyros,  accelerometers,  absolute  and  differential 
pressure  sensors.  The  autopilot  measures  3.8x54x1.9  cm  and  weighs  18  g.  The  second  frame  in  Figure  34 
shows  the  airframes  used  for  the  flight  tests  reported  in  this  section.  The  airframe  is  a  1 A  m  wingspan 
Unicorn  EPP  foam  flying  wing,  which  was  selected  For  its  durability,  ease  of  component  installation,  and 
flight  characteristics.  Embedded  in  the  airframe  are  the  Kestrel  autopilot,  batteries,  a  1000  mW,  900  MHz 
radio  modem,  a  GPS  receiver,  a  video  transmitter,  and  a  small  analog  camera.  The  third  frame  in  Figure  34 
shows  the  ground  station  components.  A  laptop  runs  the  Virtual  Cockpit  software  that  interfaces  through  a 
communication  box  to  the  MAVs.  An  RC  transmitter  is  used  as  a  stand-by  fail-safe  mechanism  to  facilitate 
safe  operations. 


Figure  34:  (a)  Kestrel  autopilot,  (b)  Unicorn  airframes,  (c)  ground  station  components. 


Cooperative  Control  Algorithm 

The  cooperative  timing  algorithm  was  implemented  as  part  of  the  Virtual  Cockpit  ground  station  software. 
The  current  implementation  is  centralized,  in  that  the  ground  station  acquired  all  the  information  required  to 
do  the  cooperative  timing  calculations  from  the  vehicles,  performed  the  calculations,  and  sent  only  the  paths 
to  be  flown  and  desired  arrival  times  back  up  to  the  MAVs.  Execution  of  the  algorithm  took  approximately 
1  s,  with  a  significant  portion  of  the  time  being  consumed  by  communication  between  the  MAVs  and  the 
ground  station.  Once  the  MAVs  received  the  paths  and  arrival  times,  the  autopilot  controlled  the  heading 
using  a  vector  field  method  [64]  and  the  throttle  based  on  ground  speed  so  that  cooperative  timing  could  be 
achieved.  The  algorithm  used  in  this  section  is  shown  in  Table  4. 


Table  4:  Cooperative  timing  algorithm. 

1  Acquire  GPS  locations  and  headings  for  each  MAY 

2  Calculate  waypoint  paths  for  each  MAV 

3  Compute  range  of  arrival  times  for  each  MAV 

4  Compute  coordination  function  for  each  MAV 

5  Based  on  coordination  functions,  choose  team  optimal  coordination 
variable  (arrival  time  of  first  MAV  and  arrival  order  for  fly-by  and 
arrival  time  for  simultaneous  arrival) 

6  Calculate  desired  arrival  time  for  each  MAV 

7  Send  waypoint  paths  and  desired  arrival  times  to  each  MAV 

8  MAVs  fly  waypoint  paths  controlling  velocities  to  achieve  cooperation 
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4.4  Results  and  Discussion 

Simultaneous  Arrival 

Consecutive  simultaneous  arrival  runs  were  flown  during  a  25-minute  flight  test.  Telemetry  data  from  one 
of  the  runs  is  shown  in  Figure  35.  Wind  was  between  30  and  60  percent  of  the  MAV  airspeed  during  each  of 
tiie  four  runs.  The  average  time  between  the  arrival  of  the  first  MAV  at  the  target  and  the  arrival  of  the  final 
MAV  was  1 .6  s.  Range- to- target  data  is  displayed  in  Figure  36.  This  illustrates  the  near  simultaneous  arrival 
of  the  MAVs  at  the  target  enabled  by  the  cooperative  timing  algorithm.  Video  footage  of  a  simultaneous 
arrival  mission  can  be  viewed  by  clicking  her**. 


Figure  35:  Simultaneous  arrival  telemetry. 


Cooperative  Fly-by 

The  fly  by  scenario  was  flown  six  times  during  a  half-hour  flight  involving  three  MAVs.  The  wind  speeds 
during  the  flight  were  between  30  and  60  percent  of  the  MAV  airspeed  and  were  from  the  southwest.  The 
position  telemetry  from  one  of  the  fly-bys  is  plotted  in  Figure  37.  The  desired  arrival  intervals  for  this  run 
were  3  s  for  both  intervals  and  the  actual  arrival  intervals  were  3  and  4  s,  respectively.  The  average  error  in 
arrival  time  for  the  six  runs,  in  spite  of  the  high  wind  conditions,  was  approximately  0.6  s.  The  range  to  the 
target  versus  time  for  each  of  the  MAVs  is  shown  in  Figure  38.  The  rise  in  the  range  curves  results  from  the 
overshoot  that  occurs  after  the  first  of  the  fly- through  waypoints.  This  illustrates  the  arrival  of  the  MAVs  at 
approximately  3  s  intervals.  Video  footage  of  a  cooperative  fly-by  mission  can  be  viewed  by  clicking  h  r  . 
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Figure  36:  Simultaneous  arrival  range  to  target. 

Figure  39  illustrates  a  scenario  where  the  length  of  the  path  of  one  of  the  MAVs  had  to  be  lengthened  to 
satisfy  the  timing  constraint.  The  MAY  that  began  in  the  bottom  of  the  frame  was  considerably  closer  to 
the  target  when  the  fly-by  command  was  issued  and  its  path  length  had  to  be  increased  for  cooperation  to 


Figure  37:  Fly-by  telemetry,  3  s  arrival  intervals. 
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Figure  38:  Fly-by  range  to  target,  3  s  arrival  intervals. 


occur.  Two  waypoints  were  added  after  the  first  waypoint  using  the  method  described  in  Section  4.2.  The 
first  two  MAVs  arrived  at  the  desired  arrival  times  while  the  third  MAV  was  delayed  slightly  due  to  the 
strong  head  wind  it  faced  during  most  of  the  run. 


east  (m) 


east  (m) 


t(m) 


east  (m) 


oast  (m) 


east  (mj- 


Figure  39:  Fly-by  telemetry  with  path  lengthened  for  MAV  L 

Although  the  timing  errors  for  both  simultaneous  arrival  and  cooperative  fly-by  arc  relatively  small, 
there  are  a  few  possible  explanations  for  the  errors  that  occurred.  One  of  the  more  obvious  reasons  wras  the 
presence  of  the  high  relative  wind  speeds.  If  the  desired  groundspeed  was  in  the  upper  end  of  the  feasible 
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airspeed  envelope  (20  m/s)  while  flying  into  a  headwind,  it  was  difficult  for  timing  constraints  to  be  met. 
Another  issue  was  synchronizing  the  timing  on  each  of  the  autopilots.  Each  autopilot  had  its  own  timer 
that  started  when  the  list  of  waypoints  finished  uploading  from  the  ground.  This  took  about  1  s  for  each  of 
the  three  agents  and  if  it  took  longer  for  one  MAV  to  finish  downloading  the  information  than  another,  the 
start  times  were  different.  Transient  behavior  involved  in  matching  the  desired  groundspeed  as  it  changed 
and  GPS  error  could  also  have  influenced  proper  timing. 

4*5  Conclusion 

Cooperative  timing  missions  for  miniature  aerial  vehicles  have  been  explored  in  this  section.  The  experi¬ 
mental  results  show  the  feasibility  of  real-time  coordination  for  teams  of  MAVs.  Three  MAVs  were  flown  in 
two  different  flights  and  multiple  cooperative  timing  missions  were  run  during  each  flight.  Both  coordinated 
fly-by  scenarios  and  simultaneous  arrival  scenarios  were  run  where  the  coordination  occurred  on  the  ground 
station  using  an  algorithm  based  on  coordination  functions  and  coordination  variables.  These  results  illus¬ 
trate  the  repeatability  and  reliability  of  this  algorithm  even  in  the  presence  of  high  winds  relative  to  MAV 
airspeed. 
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